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MOLECULES 



The present invention describes the chromosomal location of one or more genes involved 
in obesity. 

In recent years, there has been an increase in the population of patients with what are 
termed "life-style related" diseases, such as obesity and hyperlipidemia Obesity refers to a 
physical state in which the stored fat, constituted mainly of triglycerides, is accumulated 
excessively in the body. A consequence of obesity is an increased risk of arteriosclerosis, 
fatty liver, hypertension, diabetes, coronary heart disease, stroke, gallbladder disease, 
osteoarthritis, respiratory problems and some types of cancer (Visscher & Seidell, Annu Rev 
Public Health (2001); 22(1): 355-75). It has been shown recently that there can also exist a 
strong genetic component underlying the development of obesity. The role of certain genes, 
e.g., agouti, leptin (ob) and leptin receptor (db\ in the regulation of body fat distribution has 
been described (Friedman & Halaas, Nature (Oct 1998); 395(6704): 763-70; Ahima et al, 
Front Neuroendocrinol (Jul 2000); 21(3): 263-307). In humans, mutations in leptin, leptin 
receptor, melanocortin 4- receptor (MC4-R) and peroxisome proliferator-activated receptor 
(PPAR) y2 genes have been shown to be important component in patients with severe obesity 



(Clement et al\ Int J Obes Relat Metab Disord 2000 Mar ;24(3): 391-3, Yeo et al; Nat 
Genet 1998 Oct;20(2): 111-2). 



better understanding of the disease, and an opportunity to develop more specific and effective 
drugs. 

The present invention is based on the discovery of a chromosomal translocation which 
has a predisposing effect on the development of obesity. The translocation exists between 
chromosome 4 and chromosome 15. Specifically, chromosome 4 has been broken at 
cytoband 4q22.3 and chromosome 15 has been broken at cytoband 15q22.2. The outcome of 
the translocation event is that at least two genes are affected in persons having this 
translocation and that persons having this translocation are more susceptible to develop 
obesity. 

Two of the genes affected by the translocation are uncoordinated 5C (UNC5C) 
residing on chromosome 4 and RAR-like orphan receptor alpha 1 (RORocl) residing on 
chromosome 15. 



The identification of other genes involved in the development of obesity will provide a 
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The breakpoint at chromosome 4 occurs between the genes pyruvate dehydrogenase 
A2 (PDHA2) and UNC5C. While neither of these genes are destroyed in the mutation event, 
UNC5C transcription is upregulated in adipocytes. A cDNA encoding UNC5C is set out as 
SEQ ID NO: 9 (with the first nucleotide of the UNC5C coding region accorded position 155), 
and its gene product is set out as SEQ ID NO: 10. 

There is an alternative splice version produced from the UNC5C gene, where the 
exons 1 1 and 1 2 are spliced together from internal sites producing a transcript with a 
truncated open reading frame and a much shorter protein. A sequence containing this splice 
event has been determined and submitted to the GENESEQN database and has accession 
number AAC90914 (Fig. 2 and Fig.8). We confirmed the presence of this alternative 
transcript with the use of PCR on human brain material. A putative "in silico" generated full 
length clone of the alternative transcript is set out as SEQ ID NO: 13 (with the first nucleotide 
of the sUNCSC coding region accorded position 1) and its gene product is set out as SEQ ID 
NO 14. As used herein "sUNCSC" refers to this shorter splice version of UNC5C unless 
otherwise stated. The sequence AAC90914 is a truncated version of sUNCSC. 

The breakpoint at chromosome 15 is within the first intron of RORocl and this 
mutation event destroys the transcription of the RORocl isoform. A cDNA encoding RORal 
is set out as SEQ ID NO: 5 (with the first nucleotide of the RORal coding region accorded 
position 102), and its gene product is set out as SEQ ID NO:6. The translocation results in the 
20 first exon of RORal being spliced into the second exon of UNC5C, resulting in the 

expression of two novel transcripts which are fusions between RORal andUNC5C (long and 
short). These two novel transcripts are expressed in affected patients adipocytes. The first 
657 amino acids are identical for both fusion proteins. 

The first transcript results in the translation of a long fusion protein between RORal - 
UNC5C (termed herein "IRORal-UNCSC"). A cDNA encoding IRORal-UNCSC is set 
out as SEQ ID NO: 1, and its protein fusion product as SEQ ID NO:2. In SEQ ID NO: 1, 
nucleotide positions 1-233 represent that part of the transcript from RORal . Nucleotide 
positions 234-2986 represent that part of the transcript from UNC5C. In SEQ ID NO: 2, 
amino acid positions 1-55 represent that part of the protein fusion product derived from 
RORal DNA. Amino acid positions 56-383 and 403-964 represent that part of the protein 
fusion product derived from UNC5C DNA. The IRORal-UNCSC transcript is produced by 
splicing RORal exonl with UNC5C exon 2-16 together with a fusion specific exon situated 
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between UNC5C exons 7 and 8. The fusion specific exon is an extra exon of 57bp which 
encodes for an extra 19 amino acid residues and resides within the IRORal-UNC5C 
transcript at nucleotide positions 1220-1276 of SEQ ID NO:l. In SEQ ID NO:2, amino acid 
positions 384-402 represent that part of the protein fusion product derived from the fusion 

5 specific exon. It is to be noted that the first coding nucleotide for the amino acid position 384 
comes from UNC5C exon 7. 

The second transcript results in the translation of a shorter fusion protein 
between RORal -UNC5C (known herein as the "sRORal-UNC5C"). A cDNA encoding 
sRORal-UNC5C is set out as SEQ ID NO: 3, and its protein fusion product as SEQ ID NO: 

lo 4. In SEQ ID NO: 3, nucleotide positions 1-233 represent that part of the transcript from 
RORal. Nucleotide positions 234-2546 represent that part of the transcript from UNC5C. In 
SEQ ID NO:4, amino acid positions 1-55 represent that part of the protein fusion product 
derived from RORal DNA. Amino acid positions 56-383 and 403-669 represent that part of 
the protein fusion product derived from sUNCSC DNA. As for the long form, a fusion 

is specific exon is situated between UNC5C exons 7 and 8. The fusion specific exon is an extra 
exon of 57bp which encodes for an extra 19 amino acid residues and resides within the 
sRORocl-UNC5C transcript at nucleotide positions 1220-1276 of SEQ ID NO:3. In SEQ ID 
NO:4, amino acid positions 384-402 represent that part of the protein fusion product derived 
from the fusion specific exon. It is to be noted that the first coding nucleotide for the amino 

20 acid position 384 comes from UNC5C exon 7. The sRORal-UNC5C transcript only differs 
from lRORocl-UNC5C in the way that UNC5C exons 1 1 and 12 are spliced together. 
sRORal-UNCSC uses internal splice sites within UNC5C exon 11 and 12 so that the 3' part 
of exon 1 1 and the 5' part of exon 12 is omitted (Fig. 4 and Tables 1 and 2). Thus, sRORal- 
UNC5C has a destroyed open reading frame resulting in the deletion of 307 ammo acids and 

25 replacing them with 12 amino acids from another reading frame. 

Table 1 below shows DNA sequence alignment of the lRORal-UNC5C (SEQ 
ID NO: 1) and sRORal-UNC5C (SEQ ID NO: 3) fusion transcripts together with 
UNC5C(AF055634) and the alternative spliceform of UNC5C (AAC90914). Also shown are 
the genomic exon sequences of RORal exon 1 and UNC5C exon 1-16: 
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AF055634 
UNC5C_exonl 



AF055634 
UNC5C_exonl 
SEQ ID NO:l 
SEQ ID NO: 3 
• RORAl_exonl 



AF055634 
UNC5C_exonl 
SEQ ID NO:l 
SEQ ID NO: 3 
RORAl_exonl 



AF055634 
UNC5C_exbnl 
SEQ ID NO:l 
SEQ ID NO: 3 
RORAl_exonl 



AF055634 
UNC5C_exonl 
UNC5C_exon2 
SEQ ID NO:l 
SEQ ID NO: 3 
RORAl_exonl 



1 42 
CTGCCTTTGGAGAAAGTGGAGTGTGGCGCTTGGTTGTCGTTAT 
CTGCCTTTGGAGAAAGTGGAGTGTGGCGCTTGGTTGTCGTTAT 

43 102 

TTCTTCGGACTGCTTCGCGGTGCACGGATTCAGCTTCT 

TTCTTCGGACTGCTTCX5CGGTGCACGGATTCAGCTTCTC 

. CTGAAAACAGAAGATAGAGGGJ^TCTCGGAGC TCGCCATCTCCAGCGATC 

CTGAAAACAGAAGATAGAGGGAGTCTCGGAGCTCGCCATCTCC^ 

CTGAAAACAGAAGATAGAGGGAGTCTCGGAGCTra 

103 162 
TTGCGCGTCTCTCTGTCCCCCTCCCCTCCCCCCGGCACACCTCTGTCTACGATGAGGAAA 
TTGCGCGTCTCTCTGTCCCCCTCCCCTCCCCCCGGCACACCTCTC 
GGAAAAAACATGGAGTCAGCTCCGGCAGCCCXra^ 

GGAAAAAACATGGAGTCAGCTCCGGCAGCCCCC6ACCCCGCCGCCAGCGAGCCAGGCAGC 
GGAAAAAACATGGAGTCAGCTCCGGCAGCCCCCGACCCCGCCGCC^GCGAGCCAGGCA^ 

163 222 
GGTCTGCGGGCGACAGCGGCCCGCTGCGGACTGGGACTGGGATACTTGCTGCAAATGCTC 
GGTCTGCGGGCGACAGCGGCCCGCTCCGGACTGGGACTCGGATACTTGCTGCAAATGCTC 
AGCGGCGCGGACGCGGCCGCCGGCTCCAGGGAGACCCCGCTGAACCAGGAATCCGCCCK 
AGCGGCGCGGACGCGGCCGCCGGCTCCAGGGAGACCCCGCTGAACCAGGAATCCGCCCGC 
AGCGGCGCGGACGCGGCCGCCGGCTCCAGGGAGACCCCGCTGAACCAGGAATCCGCCCGC 

223 282 

GTGCTACCTGCCCTGGCC<^CTCAGCGCCAGCGGCACT^ 

GTGCTACCTGCCCTGGCCCTGCTCAGCGCCAGCGGCAGT^ 

ATGAT 

AAGAGCGAGCCGCCTGCCCCGGTGCGCAGACAGAGCTATTC 
AAGAGCGAGCCGCCTGCCCCGGTGCGCAGACAGAGCTATTCa^^ 
AAGAGCGAGCC GCC TGCCCCGGO^GCAGACAGAGCTATTCCAGCACCAGCAGAG 



283 342 
AF055634 GACTTTTTTCATGAACTCCCAGAAACTTTTC 
UNC5C_exon2 GACTTTTTTCATGAACTCCCAGAAACTTTTCCTTOT 
SEQ ID NO:l GACTTTTTTCATGAACTCCCAGAAACTTTTCCTTCTGATC 
SEQ ID NO: 3 GACTTTTTTCATGAACTCCCAGAAACTTTTG^ 

343 .402 
AF05563 4 TTCCTTATTGAGCCTGAAGAAGCTTATATTGTGAAG 
UNC5C_exoil2 TTCCTTATTGAGCCT^AAGAAGCTTATATTGTGAAGAATA^ 

SEQ ID NO: 1 , TTC ATTGAGC CTGAAGAAGCTTATATTGTGAAGAA TACTGT 

SEQ ID NO: 3 TTCCTTATTGAGCCTGAAGAAGCTTATATTGTGAAGAATAAG 

************************************************************ 

403 462 
AF055634 AAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATC 
UNC5C_exon2 AAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATGGGTTCAT 
SEQ ID NO:l . AAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTC 

SEQ ID NO: 3 AAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATGGGTTCATC 

************************************************************ 



AF055634 
DNC5C_exon2 
UNC5C_exon3 
SEQ ID NO: l 
SEQ ID NO: 3 



AF055634 
UNC5C_exon3 
SEQ ID NO:l 
SEQ ID NO: 3 



463 522 

GACCACATAGTAGATGAAAGAGTAGATGAAACTTCCGGTCTCATTGTCCG 

GACCACATAGTAGATGAAAGAGTAGATGAAACTTCCG 

GTCTCATTGTCCGGGAAGTGAGC 
GACCACATAGTAGATGAAAGAGTAGATGAAACTTCCGGTCTCATTGTC 
GACCACATAGTAGATGAAAGAGTAGATGAAACTTCCGGTC 

********* **>-•»> *************************************** ******* 

. ( ; 

523 582 
ATTGAG ATTTCG CGC CAGCAAGTGGAAGAACTCTTTGGACCTGAAGATT ACTGG TG CCAG 
- ATTGAGATTTCG CGCCAGCAAGTGGAAGAACTCTTTGGACCTGAAGATTACTGGTGCCAG 
ATTGAGATTTCGCGCCAGCAAGTGGAAGAACTCTTTG 

ATTGAGATTTCG CGCCAG CAAGTGGAAGAACTCTTTGGAC CTGAAGATTACTG GTGCCAG 
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UNC5C_exon3 
SEQ ID NO:l 
SEQ ID NO: 3 



TGTGTGGCCTGGAGCTCCGCGGGTACCACAAAGAGCCGGAAGGCGTATGTGCGCATTGCA 
TGTGTGGCCTGGAGCTCCGCGGGTAC(^CAAAGAGCCGGAAGGCGTATGTGCGCATTGCA 



643 702 
AFO 5 5 63 4 TATCTACGGAAGACATTTGAGCAGGAACCCCTAGGAAAGGAAGTGTCTTTGG 
UNC5C_exon3 T 

UNC5C_exon4 ATCTACGGAAGACATTTGAGCAGGAACCCCTAGGAAAGGAAGTGTCTTTGGAAC^ 
SEQ ID N0:1 TATCTACGGAAGACATTTGAGCAGGAACCCCTAGGAAAGGAAGTGTCTTTGGAAC^GGAA 
SEQ ID NO: 3 TATCTACGGAAGACATTTGAG(^GGAACCCCTAGGAAAGGAA^ 

703 . 762 
AFO 55 63 4 GTCTTACTCCAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAGGTC , 

UNC5C_exon4 GTCTTACTCCAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAG 
UNC5C_exon5 GTGGAATGGTTGAAA 
SEQ ID NO: 1 GTCTTACTCCAGTGTCGACCACCTGAAGGGATCCC^^ 

SEQ ID NO: 3 GTCTTACTCCAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAGGTGGAATGGTTGAAA 
************************************************************ 

763 822 
AF05563 4 AATGAAGACATAATTGATCCCGTTGAAGATCGGAATTTTTATA 

UNC5C_exon5 AATGAAGACAT AATTGATC CCG TTGAAG ATCG GAATTTTTATATT ACTATTGATCACAAC 
SEQ ID NO: 1 . AATGAAGACATAATTGATCCCGTTGAAGATCGGAATTTTTATATTACTATTGA 

SEQ ID NO: 3 AATGAAGACATAATTGATCCCGTTGAAGATCGGAATTTTT 

823 882 
AFO 55 63 4 CTCATCATAAAGCAGGCCCGACTCTCTGATACTGCAAATTACACCTGTGT^ 
UNC5C_exon5 CTCATCATAAAGCAGGCCCGACTCTC^ATACTGCAAATTACACCTO 
SEQ ID NO: 1 CTCATCATAAAGCAGGCCCGACTCTCTGATACTGCAAATTACACCTGTGTTGCCAAAAAC 
SEQ ID NO: 3 CTCATCATAAAGCAGGCCCGACTCTCTGATACTGCAAAT^ 

883 942 
AFO 5 5 634 ATTGTTGCCAAGAGGAAAAGTACAACTCCCA(^^ 
UNC5C_exoil5 ATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATG 
UNC5C_exon6 TCAACGGTGGCTGG 
SEQ ID NO:l ATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATA 

SEQ ID NO: 3 ATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATGTCAACGGTGGCTGG 
************************************************************ 



AF055634 
UNC5C_exon6 
SEQ ID NO:X 
SEQ ID NO: 3 



943 1002 
TCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTATCAGAAACGT 



TCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTATCAGAAACGT 
TCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTATCAGAAACGT 



1003 1062 
AF05 5 63 4 ACAAGGACTTGTACCAACCCGGCACC^CTCAATGGGGGTGCCTTCTGTGAAGGGCAGAGT 

SEQ ID NO : 1 ACAAGGACTTGTACCAACCCGGCACCACTCAATGGGGGTGCCTTCTGTGAAGGGCAGAGT 
SEQ ID NO: 3 ACAAGGACTTGTACCAACCCGGCACCACTCAATGGGGGTGCCTTCTGTGAAG 

************************************************************ 

1063 1122 
AF055634 GTGCAGAAAATAGCCTGTACTACGTTATG CCCAGTGGATGG CAGGTGGACGCCATGGAGC 

UNC5C_exon6 GTGCAGAAAATAGCCTGTACTACGTTATGCCCAG 

UNC5C_exon7 TGGATGGCAGGTGGACGCCATGGAGC 
SEQ ID NO:l GTGCAGAAAATAGCCTGTACTACGTTATG CCCAGTGGATGG CAGGTGGACGCCATGGAGC 

SEQ ID NO: 3 GTGCAGAAAATAGCCTGTACTACGTTATG CCCAG TGGATG GC AGG TGGACG CCATGGAGC 

************************************************************ 



1123 1182 
AF055634 AAGTGGTCTACTTGTGGAACTGAGTGCACCCACTK3GCGCAG 
UNC5C_exon7 AAGTGGTCTACTTCTGGAACTGAGTGCACCCACT^ 
SEQ ID NO:l AAGTGGTCTACTTGTGGAACTGAGTGCACCCACTGGCGC^ 
SEQ ID NO: 3 AAGTGGTCTACT/TCTGGAACTGAGTGCACCCACTGGCGCA 
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AF055634 GCCCCCAAGAATGGAGGCAAGGACTGCGACGGCCTCGTCTTGCAATCCAAGAACTGCACT 

UNC5C_exon7 GCCCCCAAGAATGGAGGCAAGGACTGCGACGGCCTCGTCTTGCAATCCAAGAACTGCACT 

SEQ ID NO: 1 . GCCCCCAAGAATGGAGGCAAGGACTGCGACGGCCTCGTCTTG 

SEQ ID NO: 3 GCCCCCAAGAATGGAGGCAAGGAC1K3CGACGGCCTCGTCTTGCAATC 



1243 

GATGGGCTTTGCATGCAGA- 
GATGGGCTTTGCATGCAGA 



1302 

AF055634 
UNC5C_exon7 

Fusion spec exon gtttcatttatcccatttcaactgaacagagaacccagaat 

SEQ ID NO : 1 GATGGGCTTTGCATGCAGAGTTTCATTTATC 

SEQ ID NO: 3 GATGGGCTTTGCATGCAGAGTTTCATTTATCCCA!^^ 



1303 1362 

AF0 55634 CTGCTCCTGATTCAGATGATGTTGCTCTCTATGTTGGGATTGTG 

Fusion spec exon GAATATGGATTTTCTT 

UNC5C_exon8 CTGCTCCTGATTCAGATGATGTTGCTCTCTATG^ 
SEQ IDNO:l GAATATGGATTTTCTOCTGCTCCTGATTC^^ 
SEQ ID NO: 3 



1363 1422 
AF055634 ATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTC 
UNC5C_exon8 ATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTGTGGCCTTGTTTG 
SEQ ID NO : 1 ATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTGTC 
SEQ ID NO: 3 ATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTGTGGC^ 

AAC90914 TTGTTTGTGTATCGGAAG 
************************************************************ 

. 1423 . 1482 

AF055634 AATCATCG TGACTTTGAG TCAGATATTATTGACTCTTCGGCACTC 

UNC5C_exon8 AATCATCG TG ACTTTGAGTCAGATATTATTGACTCTTCGGCACTCAATGG^ 

SEQ ID NO:l AATGA.TCX3TGACTTTGAGTCAGATATTATTGACT 

SEQ ID NO: 3 AATCATCGTGACTTTGAGTCAGATATTATTGACTC 

AAC90914 AATCATCGTGACTTTGAGTCAGATATTAT^ 

1483 1542 
AF05563 4 CCTGTGAACATCAAGGCAGCAAGACAAGATCTCCTGGCTOTA 
UNC5C_exon8 CCTGTGAACATCAAGGCAGCAAGACAAG 

UNC5C_exon9 - ATCTGCTGGCTGTACCCCCAGACCTCACGTCA 

SEQ ID NO:l CCTGTGAACATCAAGGC&GCAAGACAAGATCTGCTGGCTC 
SEQ ID NO: 3 CCTGTGAACATCAAGGCAGCAAGACAAGATOTGCTGG 
AAC90914 CCTGTGAACATC^GGCAGCAAGACAAGATCTGCTGGCTC^ 

************************************************************ 

1543 1602 
AF0 55634 GC^CAGCCATGTACAGAGGACCTGTCTATGCCCTC 
UNC5C_exon9 GCTGCAGCCATGTACAGAGGACCTGTCTATGCCCTGCATGACGTC 
SEQ ID NO:l GCTGCAGCCATGTACAGAGGACCTGTCTATGCCCTGCATGACGTCTCA 
SEQ ID NO: 3 GCTGCAGCCATGTACA.GAGGACCTGTCTATGCCCTGC^ 

AAC90914 GCTGCAG CCATG TACAGAGGACCTGTCT ATGCCCTGCATGACG TCTCAGACAAAATCCCA 



1603 1662 
AF055634 ATGACCAACTCTCCAATTCTGGATX^ 
UNC5C_exon9 ATGACCAACTCTCCAATTCTGGATCCACTC 
SEQ ID NO:l ATGACCAACTCTCCAATTCTGGATCCACTGCCCAACCTGA 
SEQ ID NO: 3 ATGACCAACTCTCCAATTCTGGATCCACTC 
AAC90914 ATGACCAACTCTCCAATTCTGGATCCACTGCCCAA 



1663 1722 
AF0 55634 TCAGGTGCTGTCTCCCCCCAAGATGACCTCTCTGAGTT/TACGTCCAAGCTG 
UNC5C_exonS TCAGGTGCTGTCACCCCCCAAGATGACCTCTCTGAGTT^ 
SEQ ID NO:l TCAGGTGCTGTCACCCCCCAAGATGACCTCTC^ 
SEQ ID NO: 3 TCAGGTGCTGTCACCCCO^GATGACCTCTCTGAGTTTACGTCC^ 
AAC90914 TCAAGTGCTGTCACCCCCCAAGATGACCT 

*** ******** *********************************************** 



AF055634 



1723 1782 
ATGACXCAGTCGTTGTTGGAGAATGAAGCCCTC 
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UKC 5C_exon9 ATGACCCAGTCGTTGrTCGAGAATGAAGCCCTCAGCCTGAAGAACCAGAGTCTAGCAAGG 

SEQ ID NO: 1 ATGACCCAGTOSTTGTTGGAGAATGAAGCCCTCAGCCTGAAGAACCAGAGTCTAGCAA 

SEQ ID NO: 3 . ATGACCCAGTCGTTGTTGGAGAATGAAGCCCTCA 
AAC9 09 14 ATGACCCAGTCGTTGTTGGAGAATGAAGCCCTCAGCCTGAAGAACC^ 

1783 1842 
AFO 55634 CAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAACTC 
UNC5C_exon9 CAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAACTCGCTGGGAGG 
SEQ ID NO:l CAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAACTC^ 
SEQ ID NO: 3 CAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAACTCACTG 
AAC9 0914 CAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAACT^ 

♦*******★***********************+******** ****************** 

1843 1902 

AFO 55634 GTTCCCAATTCAGGAGTCAGCTTGCTGATTC^ 

UNC5C_exon9 GTTCCCAATTCAG 

UNC5C_exonlO GAGTCAGCTTGCTGATTCCCGCTGGGGCCATTCCCCAAGGGAGAGTC 

SEQ ID NO: 1 GTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGCTGGGGCCATTC 

SEQ ID NO: 3 GTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGC 

AAC9 0914 GTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGCTGGGGCCATTCCCCAA 

1903 • 1962 

AF055 634 TACGAAATGTATGTGACTGTACACAGGAAAGAAACTATGAGGCCACCCATGGATGACTCT 
UNC5C_exonlO TACX3AAATGTATGTGACTGTACACAGGAAAGAAACTATGAG 

UNC5C_exonll GCCACCCATGGATGACTCT 
SEQ ID NO : 1 TACGAAATGTATGTGACTGTACAC^GGAAAGAAACTATGA 

SEQ ID NO: 3 TACG7LAATGTATGTGACTGTACACAGG AAAGAAACTATGAG G C CAC CCATGG ATGACTCT 

AAC90914. TACGAAATGTATGTGACTGTACACAGGAAAGAAACTATGAGGCCACCCATGGATGACTCT 
************************************************************ 

1963 • 2022 

UNC5C_exonll CAGACACTTTTGACCCCTGTGGTGAGCTGTGGG^ 
SEQ ID NO : 1 CAGACACTTTTGACCCCTGTGGTGAGCTGT^ 
SEQ ID NO: 3 CAGACACTTTTGACCCCTGTGGTGAGCTGTGGGCCCC 
AAC90914 CAGACACTTTTGACCCCTGTGGTGAGCTGTGGGCCCCC^ 

*************************** ************* 

2023 2082 
AF055 63 4 GTCGTCCTCAOTATGCATCACTGCGCAGACCCCAATACCGAGGACTGGAAAATA 
UNC5C_exonl 1 GTCGTCCTCACTATGCATCACTGCGCAGACCCCAATACO^GGACTC 
SEQ ID NO : 1 GTCGTCCTCACTATGCATCAC^CGCAGACCCCAATACCGAGGACTGGAAAATACTGCTC 
SEQ ID NO: 3 GTCGTCCTCACTATGCATCACTGCGCAGACCCCAATACCGAGGACTGGAAAATACTGCTC 
AAC9 0914 GTCGTCCTCACTATGCATCACTGCGCAGACCCCAATACCGAGGACTGGAAAATACTGCTC 

a*********************************************************** 



AP055634 
UNC5C_exonll 
DNC5C_exonl2 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2083 2142 

AAGAACCAGGCAGCACAGGGACAGTGGGAGGATGTGGTGGTGGTCGGGGAGGA^ 

AAGAACCAGGCAGCACAGGGACAGTGGGAG 

GATGTGGTGGTGGTCGGGGAGGAAAACTTC 
AAGAACCAGGCAGCACAGGGACAGTGGGAGGATGTGGTGGTGGTCGGGGAGGAAAACTTC 



AF055634 
UNC5C_exonl2 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2143 2202 
AOACCCCCTGCTACATTAAGCTGGATGCAGAGGC 
ACCACCCCCTCCTACATTCAGCTGGATGCAGAGGCCTGCCACATCCTC^ 
ACCACCCCCTGCTACATTCAGCTGGATGCAGAGGCCTGCCACATCCTCAC^ 



AF055634 
UNC5C_exonl2 
SEQ ID NO:l 
SEQ ID NO: 3 
AAC90914 



2203 2262 
AGCACCTACGCCCTGGTAGGACATTCCACCACCAA 

AGCACCTACGCCCTGGTAGGACATTCCACCACCAAAGCGGCTGCGAAGCGCCTCJVAGC^ 
AGCACCTACGCCCTGGTAGGACATTCCACCACCAAAGCGGCTGCA7^CGCCTCAAGCTC 



2263 



2322 
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AF055634 GCCATCTTTGGGCCCCTGTGCTGCTCCTCGCTGGAGTACAGCATCCGAGTCT^ 

UNC5C_exonl2 GCCATCTTTGGGCCCCTGTGCTrGCTCCTCGOTGGAGTACAGCATCCGAGTC^ 

SEQ ID NO:l GCCATCTTTGGGCCCCTGTGCTGCTCCTTCGCTGGAG TACAGCATCCGAGTCTACTGTCTG 

SEQ ID NO: 3 CTOSCTGGAGTACAGCATCCGAGTCTACTGTCTG 

AAC90914 CTCG CTGGAGTACAG CATCCGAGTCTACTGTCTG 



2323 ... 2382 

AF055634 GATGACACCCAGGATGCCCTGAAGX3AAATTTTACATCTTGAGA 

UNC5C_exonl2 GATGACACCCAGGATGCCCTGAAG 

UNC5C_exonl3 GAAATTTTACATCTTGAGAGACAGACGGGAGGACAG 

SEQ ID NO:l G ATGACACCCAGGATGC CCTGAAGGAAATTTTACATCTTGAG AGA(^AG ACGGGAGG^ 

SEQ ID NO: 3 GATGACACCC AGGATGCCCTGAAGGAAATTTTACATCT^ 

AAC 90914 GATGACACCCAGGATGCCCTGAAGGAAATTTTACATCTTGAGAGACAGACG 



2383 2442 
AF055634 • CTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGG^ 
UNC 5 C_exon 1 3 CTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGGCAG 
SEQ ID NO: 1 CTCCTAGAAGAACCTAAGGCTCTTC^^ 
SEQ ID .NO: 3 CTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGGCAGCA 
AAC90914 CTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGGCAGCA 

2443 2502 

AF0 55634 ATTCACGATATCGCCC^TTCCCTCTGGAAGAGCAAATTGCTGGCTAAA 

UNC5C_exonl3 ATTCACGATATCGCCCATTCC^TCTGGAAGAG CAAATTGCTGGCTAAATATCAG 
UNC5C_exonl4 ' GAAATT 

SEQ ID NO:l ATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATTGCTGGCTAAA 

SEQ ID NO: 3 ATTCACGATATCGCCCATTCCX^TCTGGAAGAGCAAATTGCTGGCTA 

AAC 9 0914 ATTCA.CGATATCGCCCATTCCCTCTGGAAGAGCAAATTGCTC 

2503 2562 
AP055634 CCATTTTACCATGTTTGGAGTGGATCTCA 
UNC 5C_exonl 4 CCATTTTACCA.TGTTTGGAGTGGATCTCAAAGAAA 
SEQ ID NO: 1 CCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTC 
SEQ ID NO: 3 CCATTTTACCATGTTTGGAGTGGATCTCAAAGAAAC 
AAC90914 CCA.TTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTC 

2563 2622 
AF0 5 5 6 3 4 AGATTTAGCCTGAACACAGTGG AGCTGGTTTGCAAACTCTGTGTGCGGCAGGTGGAAGGA 

UNC5C_exonl4 . AGATTTAGCCTGAACACAGTGGAGCTGGTTTC 

SEQ ID NO: 1 AGATTTAGCCTGAACACAG TGGAGCTG<5TTTG CAAACTC 

SEQ ID NO: 3 AGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAACTCTGTC 
AAC9 0914 ■ AGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAAC 



2623 2682 

AF055634 . GAAGGGCAGATCTTCCAG CTCAACTGCACCGTG TCAG AGGAACCTACTGGCATC GATTTG 

UNC5C_exonl4 GAAGGGCAGATCTTCCAG CTCAACTG CACCGTGTCAGAG 

UNC5C_exonl 5 G AAC CTACTGG CATCG ATTTG 

SEQ ID NO:l GAAGGGCAGATCTTCCAGCTCAACK3CACCGTGTCAGAG^ 

SEQ ID NO: 3 GAAGGGCAGATCTTCCAGCTCAACTGCAC CGTGTCAGAGGAACCTACTG GCATCGATTTG 

AAC90914 GAAGGG CAGATCTTCCAG CTCAACTGCACCGTGTCAGAGGAACCTACTG GCATC 



2683 2742 
AF0 55634 CCGCTGCTGGATCC^CGAACACCATCACCACGGTCACXSGGG^CCAG 
DNC5C_exonl 5 CCGCTGCTGGATCCTGCGAACACCATCACCACGGTCACGGGGC^ 
SEQ ID NO:l CCGCTGCTGGATCCTGCGAACACCATCACCAC^ 
SEQ ID NO: 3 CCGCTGCTGGATCCTGCGAACACCATCACCACGGTCAC 
AAC 9 09 14 CCGCTCCTCGATCCTGCGAACACCATCAC 

**★***★***★#★*#****★************************+*******■******** 



2743 2802 
AF 0 5 5 63 4 CCTCTCCCTATCCGGC^GAAGCTCTGTAGCAGCCTGGATGCCCCCCAGACGAGAG 
DNC5C_exonl5 CCTCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGATGC^ 
SEQ ID NO:l CX^TCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCrGGATG 
SEQ ID NO: 3 CCTCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGATGC^ 
AAC 9 0 9 14 CXyrcTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGATGCCCCCCAG^ 
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AF055634 
DNC5C_exonl5 
UNC5C_exonl6 
SEQ ID NO:l 
SEQ ID NO:3 
AAC90914 . 



AF055634 
UNC5C_exonl6 
SEQ ID N0:1 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C_exonl6 
SEQ ID.NO:l 
SEQ ID NO: 3 
AAC90914 



AF055634 
UNC5C_exonl6 
SEQ ID NO:l 
SEQ ID NO:3 
AAC90914 



AF055634 
UNC5C_exonl6 



AF055634 
UNC5C_exonl6 



AF055634 
UNC5C_exonl6 



2803 2862 

GACTGGAGGATGCTGGCCCATAAGCTGAACCTGGACAGGTACTTGAATTACTT^ 

GACTGGAGGATGCTGGCCCATAAGCTGAACCTGGACAG 

. GTACTTGAATTACTTTGCCACC 
GACTGGAGGATGCTGGCCCATAAGCTGAACCrcGACAGGTACrT^ 
GACTGGAGGATGCTG G CCCATAAGCTGAACCTGGACAGGTACTTGAATTACTTTG CCAC C 
GACTGGAGGLA/TCCTGGCCCATAAGCTGAACCT 

2863 2922 

AAATCCAGCCCAACTGGCGTAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATG^ 

AAATCCAGCCCAACTGGCGTAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATC 

AAATCCAGCCCAACTGGCGTAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATGGA 

AAATCCAGCCGAACTGGCGT AATCCTGGATCTTTGGGAAG CACAGAACTTCC CAGATGGA 

AAATCCAGCCCAACTGGCGTAATCCrrcGATCTTTGGGAAGCACAGAACITCC 

************************************************************ 

2923 2982 

AACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAA^ 

AACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

AACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

AACCTGAG CATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

AACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACATGAAACGGTGGTGTCC 

************************************************************ 

2983 3003 3028 3042 

TTAGCAGCAGAAGGG CAGTATTAACCACCATGCTGG AAGGGG AAATG CAGT ATTAACCAC 
TTAGCAGCAGAAGGGCAGTATTAACCACCATGCTGGAAGGGGAAATGCAGTATTAACCAC 

TTAGCAGCAGAAGGGCACTATTAACCACCATGCTGGAAGGGGAAAT 

TTAGCAGCAGAAGGGCAGTATTAACCACCATGCTGGAAGGGGAAAT 

TTAGCAGCAGAAGGGCAGTAT 

************************************************************ 

3043 3102 
CATGCTGGAAGGGGAAATGAAGGACAAAAATGCACAGGGAGTCTGTGGC 
CATGCTGGAAGGGGAAATGAAGGACAAAAATGCACAGGGAGTCTGTGGCCGTCCAGGTGA 
************************************************************ 

3103 3162 
ATCACAGCTGAGGAGGAAATCCAGATGAGACCAATGCACTTCACAGGCAAGAATGCAGCA 
ATCACAGCTGAGGAGGAAATCCAGATGAGACCAATGCACTTCACAGGCAAGAC 
********************* ********************* ********** ******* 

3163 3222 
GGAGCCAGAAGGAAAACAGATACAACTGCCCATGTACATGCCCACTTTACTC 
GGAGCCAGAAGGAAAACAGATACAACTGCCCATGTACATGCCCAC 
******************************************************* **** 



3223 3282 
AF055634 TCACGGGAGTTAAGAAAAATTGTGTAAATTTGTACCTTGAATTT^ 
DNC5C_exonl 6 TCACGGGAGTTAAGAAAAATTGTGTAAATTTGTACCTTGAATTTA^ 

************************************************************ 

3283 3342 
AF055634 TTCCTCTTAGTTGGGCTCTATGCTGTGTGGTACAGGA 
UNC5C_exonl 6 TTCCTCTTAGTTGGGCTGTATGCTGTGTGGTACAGGATCTTAC^ 

3343 3402 
AF055634 TTTTTATTGCTATCCAGATATATGGATAAACTTTCTTAACAA^ 
UNC5C_exonl6 TTTTTATTGCTATCCAGATATATGGATAAACTTTCTTAAC 

************************************************************ 



3403 3462 
AF055634 TTGTTTAC^TCAAATTGGACAGGGATGCAGACACTGTCCATGGCTCG 
UNC5C_exonl 6 TTGTTTACATCAAATTGGACAGGGATGCAGACAC 

************************************************************ 



' AF055634 



3463 3522 
CAAATCATTTGAAGTTGAAGCTG TG GACGG TTTGTTGTGTOT ATTTCAGATT AG TAATTT 



WO 2004/083371 



PCT/GB2004/001124 



-10- 



UNC5C_exonl6 



CAAATCATTTGAAGTTGAAGCTGTGGACGGTTTGTTGTC 

************************************************************ 



3523 3582 
AFO 5 5 63 4 . ACAG AGAAATCACAGACTTTTG CTACAAATCG TG TGCATC AAGTGTC TCAGAT AATC CTC 

UNC5C_exonl6 ACAGAGAAATCACAGACTTTTGCTACAAATCGTGTGCATCAAGTGTC 

3583 3642 
AFO 55634 CCATCAGTGTTCTGTTTCTAGAACTTGTAGAACCAGTGTTACTGTTTC 
UNC5C_exonl 6 CCATCAGTGTTCTGTTTCTAGAACrTCTAGAACC^G 

3643 3702 
AFO 55634 GGAGAATCTAAGTGTAAAAAAGAAATAACTAAGACTCCTATTCCTTGGAGGGACCCTTCT 
UNC5C_exonl 6 GGAGAATCTAAGTGTAAAAAAGAAATAACTAAGAOTCCTATTCCTTGGAGGGACCCTTCT 



3703 3732 
AFO 5 5 63 4 GGTGCCCTTTGGGAATAAAGCTGTAGCACTGC 
UNC5C_exonl6 GGTGCCCTTTGGGAATAAAGCTGTAGCACTGC 

******************************** 



Table 2 below shows a protein alignment of the IRORal-UNCSC (SEQ ID NO: 2) and 
sROR<Xl-UNC5C (SEQ ID NO: 4) fusion proteins with UNC5C: 

Table 2. 

1 ■ 59 



UNC5C exon 1 MRKGLRATAARCGLGLGYLLQMLVLPALALLSASGTGSAAQ 

UNC5C MRKGLRATAARCGLGLG YLLQMLVL PAL ALL S ASG TG S AAQDDDF 

SEQ ID NO: 2 MESAPAAPDPAASKPGSSGADAAAGSRETPLNQESARKSEPPAPVRRQSYSSTSRDDDF 
SEQ ID NO: 4 MESAPAAPDPAASEPGSSQADAAAQSRETPI*NQESARKSEPPAPVRRQSYSSTSRDDDF 
RORal exon 1 MESAPAAPDPAASEPGSSGADAAAQSRETPLNQESARKSEPPAPVRRQSYSSTSR^^ 

60 119 

UNC5C FHELPETFP SDPPEPLPHFL2EPEEAY IVKNKPVNLYCKAS PATQIYFKCNSEWVHQKDH 

SEQ ID NO: 2 FHELPETFP SDPPEPLPHFLIEPEEAYXVKNKPVNLYCKAS PATQIYFKCNSEWVHQKDH 

SEQ ID NO: 4 FHELPETFPSDPPEPLPHFL I EPEEAYIVKNKP VNLYCKASPATQIYFKCNSEWVHQKDH 



120 _ 179 
SEQ 5 ID NO: 2 iTOERVDETSGLIVREVSIEISRC^^ELFGPEDYVrcO^^ 

SEQ ID NO: 4 IVDERVDETSGL IVREVS I E I SRQQVEELFG PEDYWCQCV AWS S AGTTKSRKAYVRIAYL 

************************************************************ 

180 239 
UNC5C RKTFEQEPLGKEVSLEQEVLLQCRPPEGI PVAEVEWLKNEDI IDFVEDRNFYITIDHNLI 

SEQ ID NO: 2 RKTFEQEPIiGKEVSI*EQEVLLQCRPPEGI PVAEVEWLKNEDI I DPVEDRNFYI TI DHNLI 

SEQ ID NO: 4 RKTFEQEPIiGKEVSLEQEVLLQCRPPEGIPVAEVEWLK^^ 

************************************************************ 

240 299 
UNC5C IJCQAPXSE)TANYTCVAKNIVAKPJCSTTATVIVYVN 

SEQ ID NO: 2 IKQARLSDTANYTCVAKNI VAKRKSTTATVTVYVNGGWSTWTEWS^ QKRTR 

SEQ ID NO: 4 IKQARLSE^ANYTCVAKNIVAKl^STTATVIVYVNGGWSTWTF^ 



300 359 
X3NC5C TCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTPWSKWSTCGTECT}^ 
SEQ ID NO: 2 TCTNPAPI^GGAFCEGQSVQKIACTTLCPVDGRWPWSKWSTCGTECTHWRRRECTAPAP . 

SEQ ID NO: 4 TCTNPAPLNGGAFCEGQSVQKIACTTIjCPVDGRWTPWSKWSTCGTEC^ 

************************************************************ 

360 ^19 

UNC5C KNGGKDCDGLVLQSKNCTDGLCMQ TAPDS DDVAL YVGIVIA 

SEQ ID NO: 2 KNGGKDCDGLVLQSKNCTDGLCMQSFIYPISTEQRTQNEYGFSSAPDSDDVAL YVGIVIA . 

SEQ ID NO: 4 KNGGKIX;E)GLVLQS^CTDGLCMQSFIYPISTEQRTQNEYGFSSAPDSDDVAL YVGIVIA 
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Fusion specific exon 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
. SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



DNC5C 

SEQ ID NO: 2 
SEQ ID NO: 4 



UNC5C 

SEQ ID NO: 2 



UNC5C 

SEQ ID NO: 2 



DNC5C 

SEQ ID NO: 2 



DNC5C 

SEQ ID NO: 2 



SF IYP ISTEQRTQNEYGFS 
************************ **************** 

420 479 
VIVCLAISVWALFVYRKNHRDFESDIIDSSAIjNGGFQPVNIKAARQDLLAWPDLTS^ 
VIVCIAISVWALFVYRKNHRDFESDIIDSSALNGGFQPVNIKAARC3DLLAVPPDLTS 
VI VCLAI S VVVAIjFVYRKNHRDFESDI I DS SALNGGFQPVNIKAARQDLLAVPPDIiTS AA 
************************************************************ 

480 539 
AMYRGPVYALHDVSDKJPimJSPILDPLPmKIKVYOTSGAVSP^DDLSEFTSKLSPQ^ 
AMyRGPVYALHDVSDKIPMTNSPILDPLPNLKIKVYNTSGAVTPQDDLSEFTSKLSPQMT 
AMYRGPVYALHDVSDKIPMTNSPILDPLPNLKIKVYN^ 

540 599 
QSLLENEALSLKNQSLARQTDPSCTAFG SFNSLGGHL IVPNSGVSLL IPAGAI PQGRVYE 
QSLLENEALSLKNQSLARQTDPSCTAFGSFNSLGGHLIVPNSGVSLLIPAGAIPOGRVYE 
QSLLENEALSLKNQSLARQTDPSCTAFGSFNSLGGHLrVPNSGVSLLIPAGAIPO^RVYE 
************************************************************ 

600 659 

MYVTVHRKETMRPPMDDSQTLLTPWSCGPPGALLTRFVVL 

JbTY\nVHRKJETMRPPMDDSQTL«L^ 

MVVTN^RKETI^PMDDSOjrLIjTPWSCGPPGAIjLTRPV^ 

660 719 

QAAQGQWEDVVWGEENFTTPCTIKLDAEACHILTENLSTYALV^ 

QAAQ^QWEDVVWGEENFTTPCTIQLDAEACHILTENI^ 

GVQHPSLLSG 

720 779 
FGPLCC S SLEY S IRVYCLDDTQDALKEILHLERQTGGQIJjEEPKAIjPIFKG STHNLRLS IH 
FGPLCCSSLEYSIRVYCLDDTQDAIiKEILHLERQTGGQLLEEPK^ 

780 839 

DIAHSLWKSIQjLAKyQEIPFYHVWSGSQRNLHCTFTLERFSLNTVELV 

DIAHSLWKSKLLAKyQEIPFYHVWSGSQRNLHCTFTLERFSLNTVELV 

840 899 
QIFQIiNCTVSEEPTGIDLPLLDPANTITTVTGPSAFSIPIiPIRQKLCSSLDAPQTRGHDW 
QIFQLNCTVSEEPTG IDLPLLDPANTITTVTGPSAFSI PLPIRQ^LC S SLDAPQTRGHDW 

900 959 
RMIJ^KLNLDRYliNYFATKS SPTGVILDLWEAQNFPDGNLSMLAAVLEEMGRHETWSLA 
RMIoAHKLNI^RYI^TyFATKS SPTGVILDLWEAQNFPDGNIj SMLAAVLEEtyGRHETVVSLA 



DNC5C 

SEQ ID NO: 2 



960 

AEGQY 

AEGQY 



As used herein "RORocl -UNC5C" refers to both the short and large fusion protein, 
unless otherwise stated. 

The genomic DNA sequence at the fusion site between chromosome 4 (nucleotide 
5 positions 1-3221) and chromosome 15 (nucleotide positions 3229-6277) isolated from the 
translocation 4:15 is set put as SEQ ID NO:ll. Nucleotide positions 3222-3228 of SEQ ID 
NO:l 1 are a 7bp insertion created in the translocation event The localisation of this sequence 
is shown in Fig. 4. 

The genomic DNA sequence at the fusion site between chromosome 15 (nucleotide 
10 positions 1-998) and chromosome 4 (nucleotide positions 1036-1829) isolated from the 
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translocation 15:4 is set out as SEQ ID NO:12. Nucleotide positions 999-1035 of SEQ ID 
NO: 12 are a 36bp insertion created in the translocation event The localisation of this 
sequence is shown in Fig. 5. 

A cDNA encoding a novel isoform of RORoc, designated "RORa5", is set out as SEQ 
5 ID NO: 7 (with the first nucleotide of the RORocS coding region accorded position 188), and 
its gene product is set out at SEQ ID NO: 8. 

According to one aspect of the present invention there is provided a polypeptide 
selected from the group consisting of: 

i) a fusion peptide of RORal - UNC5C polypeptide comprising SEQ ID NO:2 
io or a fragment thereof of at least 10 amino acids wherein the fragment 

comprises either the junction of the fusion protein, the flanking regions of 
the fusion specific exon or the fusion specific exon of SEQ ID NO:2; 

ii) a fusion polypeptide of RORal - UNC5C polypeptide comprising SEQ ID 
NO:4 or a fragment thereof of at least 10 amino acids wherein the fragment 

15 comprises either the junction of the fusion protein, the flanking regions of 

the fusion specific exon or the fusion specific exon of SEQ ID:4. 

iii) a polypeptide of RORoc5 polypeptide comprising the amino acid sequence of 
SEQ ID NO:8 or a fragment thereof of at least 1 0 amino acids 

or an amino acid sequence at least 90%, preferably 95%, 98% or 99% identical to 
20 either i) or ii) or iii) or a complement thereof. 

In a preferred embodiment, a conservative analojgue of the protein is contemplated. 
In this specification conservative amino acid analogues of specific amino acid 
sequences are contemplated which retain the relevant biological properties of a component of 
the invention but differ in sequence by one or more conservative amino acid substitutions, 
25 deletions or additions. However the specifically listed amino acid sequences are preferred. 
Typical conservative amino acid substitutions are tabulated below. 
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CONSERVATIVE SUBSTITUTIONS 



Original 


Exemplary Substitutions 


Preferred Substitutions 


Ala (A) 


Val; Leu; He 


Val 


Arg(R) 


Lys; Gin; Asn 


Lys 


Asn (N) 


Gin; His; Lys; Arg 


Gin 


Asp(D) 


Glu 


Glu 


Cys(C) 


Ser 


Ser 


Gln(Q) 


Asn 


Asn 


Glu (E) 


Asp 


Asp 


Gly(G) 


Pro 


Pro 


His (H) 


Asn; Gin; Lys; Arg 


Arg 


He(D 


Leu; Val; Met; Ala; Phe; Norleucine 


Leu 


Leu(L) 


Norleucine; He; Val; Met; Ala; Phe 


lie 


Lys(K) 


Arg; Gin; Asn 


Arg 


Met(M) 


Leu; Phe; He 


Leu 


Phe(F) 


Leu; Val; lie; Ala 


Leu 


Pro(P) 


Gly 


Gly 


Ser(S) 


Thr 


Thr 


Thr(T) 


Ser 


Ser 


Trp(W) 


Tyr 


Tyr 


Tyr(Y) 


Trp; Phe; Thr; Ser 


Phe 


Val(V) 


He; Leu; Met; Phe; Ala; Norleucine 


Leu 



In this specification nucleic acid variations (deletions, substitutions and additions) of 
5 specific nucleic acid sequences are contemplated which retain which the ability to hybridise 
under stringent conditions to the specific sequence in question. Stringent conditions are 
defined as 6xSSC, 0.1% SDS at 60° C for 5 minutes. However specifically listed nucleic acid 
sequences are preferred. It is contemplated that peptide nucleic acid may be an acceptable 
equivalent of polynucleotide sequences, at least for purposes that do not require translation 
10 into protein (Wittung (1994) Nature 368, 561). 
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By "the junction of the fusion protein", we mean that the fragment of at least 10 
amino acids includes the amino acid accorded position 55 and 56 of SEQ ID NO:2 or SEQ ID 
NO:4. 

By "the flanking regions of the fusion specific exon" we mean that the fragment of at 
5 least 10 amino acids includes either the amino acids accorded position 383 and 384 or the 
amino acids accorded position 403 and 404 of SEQ ID NO:2 or SEQ ED NO:4 

The invention further features an isolated nucleic acid molecule capable of encoding 
the RORal-UNC5C polypeptide or a fragment thereof or the RORa5 polypeptide or a 
fragment thereof as defined above or a complement thereof, or a nucleic acid molecule having 
10 at least 65% preferably 75%, 85%, 90%, 95%, 98% or 99%, identity thereto. Preferred 
nucleotide sequences are SEQ ID NO:l, 3 or 7. 

The invention further features a vector comprising the nucleic acid molecule as 
defined above. 

Also included within the invention is the vector as defined above, wherein said vector 
15 is an expression vector. 

The invention further features a host cell comprising the expression vector as defined 

above. 

The invention further features a method of making the RORal-UNC5C polypeptide or 
the RORaS polypeptide as defined above comprising: incubating the host cell as defined 
20 hereinabove in a medium conducive to expression of the polypeptide and optionally purifying 
the polypeptide. 

The invention further features an antibody specific for the RORal-UNC5C 
polypeptide or the RORocS polypeptide as defined above. 

The invention further features a method for detecting the presence of the RORal- 
25 UNC5C polypeptide or the RORocS polypeptide as defined above in a biological sample 
comprising: 

contacting the sample with a binding agent capable of specifically binding to the 
polypeptide, and measuring the amount of binding. 

30 The invention further features a method of detecting the presence of the isolated 

nucleic acid molecule as defined above comprising: 

contacting said nucleic acid with a nucleic acid capable of specifically hybridizing to 
the translocation breakpoint. 
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Regions of specificity in the fusion protein/gene include, for example, the junction 
point and fusion specific insert. 

In yet another aspect, the invention features an isolated nucleic acid molecule 
comprising a nucleotide sequence having at least 65% identity to a degenerate variant of SEQ 
5 ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 7. The invention further features an isolated 
nucleic acid molecule comprising the complement of SEQ ID NO: 1, SEQID NO: 3 or SEQ 
ID NO: 7. 

The invention further features an isolated nucleic acid encoding a RORal-UNC5C 
polypeptide, the isolated nucleic acid molecule comprising a nucleotide sequence having at 
10 least 65% identity to a degenerate variant of SEQ ID NO: 1 or SEQ ID NO: 3. 

The invention further features an isolated nucleic acid encoding a RORocS polypeptide, 
the isolated nucleic acid molecule comprising a nucleotide sequence having at least 65% 
identity to a degenerate variant of SEQ ID NO: 7. 

The invention further features a vector comprising the nucleic acid molecule of any of 
is the above and a host cell comprising the vector. 

Also included within the invention is an isolated nucleic acid molecule encoding a 
protein having 65% sequence identity to the amino acid sequence of SED ID NO:2, SEQ ID 
NO: 4 or SEQ ID NO: 9. Moreover the invention features a purified polypeptide comprising 
an amino acid sequence having at least 65% identity to the amino acid sequence of SEQ ID 
20 NO: 2, SEQ ID NO: 4 or SEQ ID NO: 9. 

Also within the invention is a method for producing a protein comprising culturing the 
host cell having SEQ ID NO:l, SEQ ID NO: 3 or SEQ ID NO:7 under conditions whereby 
said protein is produced, and recovering the protein from the host cell culture. 

The invention further includes a method for detecting a polynucleotide which encodes 
25 a protein comprising the amino acid sequence of SEQ ID NO:2, SEQ ID NO: 4 or SEQ ID 
NO: 8 in a biological sample comprising the steps of: 

a) hybridizing the complement of the polynucleotide sequence which encodes SEQ ID 
NO: 2, SEQ ID NO: 4 or SEQ ID NO: 8 to a nucleic acid material of a biological sample, 
thereby forming a hybridization complex; and 
30 b) detecting the hybridization complex, wherein the presence of the complex correlates 

with the presence of a polynucleotide encoding a protein in the biological sample. 

One aspect of the invention is directed to a method for detecting the presence of other 
obesity susceptibility genes which may be responsible for the development of obesity by 
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determining the identity of genes in and around the resulting translocation junction t(4: 1 5)or 
t(15:4). The method includes determining the identity of genes in and around the 
translocation junction, e.g., located 20 cM, or less, adjacent to the junction and determining if 
the gene shows aberrant expression compared to the gene in a person not having the 
5 chromosomal translocation. 

As used herein, an "obesity susceptibility gene" or "obesity gene" is a gene which as a 
consequence of the translocation event between chromosome 4 and chromosome 15 has a 
predisposing influence in the development of obesity in a subject For example, an obesity 
susceptibility gene can refer to any or all of the following genes: UNC5C, sUNCSC, RORal, 
10 RORa5,lRORal-UNC5CandsRORal-UNC5C. 

As used herein "obesity protein" or "obesity susceptibility polypeptide" is the gene 
product of the obesity susceptibility gene or obesity gene. 

In yet another aspect, the invention features a method for detecting the chromosomal 
translocation comprising analysing a sample of DNA from an individual for the presence of 
is the translocation breakpoint, wherein the presence of the translocation breakpoint indicates 
that the person has a susceptibility to developing obesity. The nucleic acid sequence of the 
translocation junctions t(4:15) and t(15:4) is set out in SEQ ID NO: 11 and SEQ ID NO: 12 
respectively. The localisation of these sequences are shown in Figs. 4 and 5 respectively. 
The invention further includes a method for identifying a test compound that 
20 modulates the expression of an obesity susceptibility gene such as UNC5C, sl)NC5C, 

RORal, RORa5, IRORal-UNCSC arid sRORal-UNC5C identified in the methods above, 
comprising contacting a cell capable of expressing an obesity susceptibility gene with a test 
compound; and determining the level of expression of the obesity susceptibility gene in the 
presence of the test compound, wherein a decrease or an increase in the obesity susceptibility 
25 gene expression, as compared to the level of expression of the obesity susceptibility gene in 
the absence of the compound, is indicative that the test compound modulates the expression of 
the obesity susceptibility gene. 

Also within the invention is a method of identifying a test compound that modulates 
the activity of a protein encoded by an obesity susceptibility gene such as UNC5C, sUNC5C, 
30 RORal, RORocS, IRORal-UNCSC and sRORal-UNC5C, comprising contacting the protein 
with a test compound and determining the level of activity of the obesity protein in the 
presence of the compound, wherein a decrease or an increase in protein activity is indicative 
that the test compound modulates protein activity. 
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The invention further features a method of treating a subject having obesity 
comprising administering an effective amount of the compound identified above. The 
invention further includes a pharmaceutical composition comprising the compound identified 
as above, and a pharmaceutically acceptable adjuvant, diluent or carrier. 

The invention further includes a method of making a pharmaceutical composition. In 
one aspect, the method includes: contacting a cell capable of expressing an obesity 
susceptibility gene with a test compound; determining the level of expression of the obesity 
susceptibility gene identified above in the presence of the test compound, wherein a decrease 
or an increase in expression of the obesity susceptibility gene, as compared to the level of 
expression of the obesity susceptibility gene in the absence of the compound, is indicative that 
the test compound modulates the obesity susceptibility gene expression; and formulating the 
test compound that modulates the obesity susceptibility gene expression into a pharmaceutical 
composition. 

In another aspect, the invention includes making a pharmaceutical composition 
including contacting an obesity susceptibility protein with a test compound; determining the 
level of activity of the obesity susceptibility protein in the presence of the compound, wherein 
a decrease in the presence of the obesity susceptibility protein, as compared to the level of 
activity of the obesity susceptibility protein in the absence of the compound, is indicative that 
the test compound modulates obesity susceptility protein activity; and formulating the test 
compound that decreases obesity susceptibility protein activity into a pharmaceutical 
composition. 

In another aspect, the invention includes a method for determining if an obesity 
susceptibility gene identified above has an altered level of gene expression comprising 
comparing the level of obesity gene expression in a cell from a patient having obesity with a 
control cell, and determining the level of expression of the obesity susceptibility gene in both 
cells, wherein a decrease or an increase in expression of the obesity susceptibility gene, as 
compared to the level of expression of the obesity susceptibility gene in the control cell, 
indicates that the obesity susceptibility gene has altered gene expression. 

. The invention further includes methods of diagnosing obesity or a susceptibility 
i thereto in a subject. The method includes determining the level of an obesity susceptibility 
gene mRNA and/or the level of an obesity causing protein in a sample from a subject; and 
comparing the level of the mRNA and/or protein in the sample with a control, wherein a 
decrease or an increase in the level of the mRNA and/or protein in the sample compared to the 
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control indicates that the subject has obesity, or a susceptibility thereto. The invention also 
extends to products useful for carrying out the assay, such as DNA probes (labelled or 
unlabelled), kits and the like. 

The invention further includes methods of diagnosing obesity or a susceptibility 

5 thereto in a subject The method includes analysing for the presence of RORocl-UNC5C 
mRNA and/or protein in a sample from a subject; Wherein the presence of the RORal- 
UNC5C mRNA and/or protein indicates that the subject has obesity, or a susceptibility 
thereto. The invention also extends to products useful for carrying out the assay, such as 
DNA probes (labelled or unlabelled), kits and the like. 

10 The present invention also includes gene-based therapies directed at obesity. 

Therapies may be in the form of polynucleotides comprising all or a portion of the obesity 
susceptibility gene(s), or obesity gene locus, placed in appropriate vectors or delivered to 
target cells in direct ways which would modify the function of the obesity protein. 

General molecular biology techniques are described in "Current Protocols in Molecular 
15 Biology Volumesl-3, edited by F M Asubel, R Brent and R E Kingston; published by John 
Wiley, 1998 and Sambrook, J. and Russell, D.W., Molecular Cloning: A Laboratory Manual, 
the third edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 2001. 

Fig. 1 shows a pedigree of the family with schematically drawn chromosomes 
showing the inheritance of the balanced translocation where all affected individuals are 
20 extremely obese. BMI = body mass index. 

Fig. 2 shows the normal gene map of human chromosome 4 at the -translocation 
breakpoint 

Fig. 3 shows the normal map of chromosome 15 at the translocation breakpoint with 
the RORa locus. 

25 Fig. 4 shows the outcome of the translocation event at chromosome 4 (translocation 

chromosome 4: 1 5) with the resulting fusion transcripts. 

Fig. 5 shows the outcome of the translocation event at chromosome 15 (translocation 
chromosome 1 5 :4) with the destruction of RORal . 

Fig. 6 shows a schematic view of the protein domain architecture of the IRORal- 
30 UNC5C and sRORocl-UNC5C fusion protein transcripts together with that of normal human 
UNC5C. 
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TM 

Fig. 7 shows Taqman analysis of RORaS expression in various tissues, with the 

highest expression being found in brain. Fig. 7 shows RORaS expression levels where the X 
axis labelling indicates the following tissues; A = bone marrow, B = brain, C = heart, D = 
kidney, E = liver, F = lung, G = testis, H = skeletal muscle, I = small intestine, J = spleen and 
K = fat. 

Fig. 8 shows a schematic view of the IRORocl-UNCSC and sRORal-UNCSC fusion 
transcripts together with UNC5C and sUNCS'C. 

The invention is based on the finding that a genetic alteration in patients with obesity 
is the translocation of two specific chromosomes (t(4;15)). This event leads to the production 
of two fusion proteins that is thought to promote the development of obesity. It also destroys 
one copy of RORal (SEQ ID NOS: 5 & 6) thus maybe reducing the expression of this gene 
in certain tissues during development. It may also affect the expression of the other isoforms 
of RORa, in particular RORocS (SEQ ID NOS: 7 & 8) which is the closest situated isofonn of 
RORa on the other side of the breakpoint at chromosome 15. 

The breakpoint in chromosome 4 occurs at cytoband 4q22.3 and the breakpoint in 
chromosome 15 occurs at cytoband 15q22.2. The balanced translocation results in patients 
having one normal chromosome 4 and one normal chromosome 15 together with the 
reciprocal translocation chromosomes. The outcome of the translocation event is that the 
expression and\or activity of at least one or more genes is affected. 

The breakpoint at chromosome 4 is between the genes Pyruvate dehydrogense A2 
(PDHA2) and UNC5C (SEQ ID NOS: 9 & 10). While neither of these genes are destroyed in 
the mutation event, UNC5C was found to be upregulated in adipocytes from obese affected 
individuals, as compared with control adipocytes. 

UNC5 is required for correct projections of pioneer axons and for migrating cells 
along the body wall in C. elegans (Leung Hagensteijn et. al., Cell 1992 Oct 71 (2): 289-99). 
UNC5 genetically interacts with UNC6 and UNC40. Mutants of all three genes defects in the 
axon projections. UNC5 and UNC40 are transmembrane receptors expressed on developing 
neurons. Both are receptors for the diffusible protein UNC6/Netrin (Hedgecock et al.; Neuron 
1990 Jan ;4(1): 61-85) (Wadsworth; Trends Neurosci 2002 Aug ;25(8): 423-9) Vertebrates 
have at least 3 UNC5 like genes named UNC5A, UNC5B and UNC5C (Leonardo et. al.; 
Nature 1997 Apr ;386 (6627): 833-8) and the protein contains two immunoglobulin domains, 
two trombospondin domains on the extracellular part and intracellular domains ZU5, DB and 
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a Death Domain (Leung Hagensteijn etaL, Cell 1992 Oct 71 (2): 289-99) (Keleman & 
Dickson; Neuron 2001 Nov ;32(4): 605-17). Human UNC5C is localised on chromosome 4 
at cytoband q22.3 (Ackerman & Knowles; Genomics 1998 Sep ;52(2): 205-8). 

The breakpoint at chromosome 1 5 is within the first intron of RORocl . This isoform of 

5 RORa is destroyed by the mutation event. Moreover, it was found that the translocation 
resulted in a novel transcript, which is a fusion between RORal exon 1 with UNC5C exon 2 
producing 2 novel fusion proteins through differential splicing within the UNC5C gene (SEQ 
ID NO:2 and SEQ ID NO: 4). 

RORal is an orphan nuclear hormone receptor localized on chromosome 15 q22.3 (V. 

10 Giguere et. al Genomics 1995 Aug ;28(3): 596-8). Nuclear hormone receptors (NHR) 

constitute a super family of highly conserved intracellular receptors found in nematodes, flies 
and vertebrates. The protein architecture is a N-terminal transactivation domain followed by a 
DN A binding domain, hinge region and a c-tenninal ligand binding domain. Many NHRs 
form dimers upon activation and translocate into the nucleus where they bind DNA to activate 

15 or; repress target gene expression. There are four publicly available isoforms of RORal, 
namely RORal -4. 

We have identified yet another isoform of RORa, designated RORcx5 (SEQ ID 7 & 8). 
RORaS was isolated with 5 'RACE (rapid amplification of C-terminal ends) performed on 
public available adipocyte cDNA library (clontech). The expression pattern of RORa5 is 

20 similar to RORal, and is most abundantly expressed in the brain (Fig. 7). 

Accordingly, the present invention describes the chromosomal location of obesity 
susceptibility genes and provides methods for detecting the presence of such genes. The 
present invention enables the development of novel therapies for obesity by screening for 
compounds and other entities, such as antibodies, which modulate the activity of the proteins 

25 encoded by the associated genes, e.g., UNC5C, sUNC5C, RORal, RORa5, lRORal-UNC5C 
and sRORal-UNC5C. Knowledge of the sequence of the obesity susceptibility genes also 
enables the development of novel antigene methods to modulate the expression of the obesity 
susceptibility gene and also enables the development of novel gene therapy techniques to treat 
obesity. The discovery of obesity susceptibility genes may also assist in developing novel 

30 methods for diagnosing obesity via (i) measuring the levels of the translated mRNA present in 
affected tissue and (ii) measuring the levels of the protein in affected tissue. It is possible that 
the diagnosis of obesity, or the susceptibility of an individual to obesity, by these methods . 
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may be achieved in patients who do not yet display the classical symptoms of the disease. 
Such determination of susceptibility to obesity or the early detection of disease development 
may lead to earlier clinical intervention, than is currently possible, and may lead to more 
effective treatment of the disease. 
Further identification of obesity susceptibility genes 

The present findings describes a chromosomal translocation which alters the 
expression\activity of genes in and around the translocation junction causing obesity. The 
genetic location of such genes are at the breakpoint of a translocation between chromosome 
4q22.3 and chromosome 15q22.2. Thus, the present invention includes methods of 
identifying genes around the translocation breakpoint and determining if any gene around the 
translocation breakpoint has abherrant expression. The invention includes identifying genes, 
e!g., 8, 7, 6, 5, 4, 3, 2 or 1 cM around the translocation breakpoint and comparing the 
expression profile of these genes against the control genes. This can be done by using 
techniques known in the art to further analyze and delineate this genetic region. For example, 
using adipocytes isolated from obese individuals carrying the translocation and control 
adipocytes it is possible to compare the expression profile of an identified obese susceptibility 
gene identified in the region, or the activity of the gene product of the gene and determine 
abherrant expression or activity. 

Techniques such as Northern analysis, in situ hybridization or expression profiling can 
be used to further verify the association of these genes with obesity. In one example, a 
reporter-based assay maybe devised to detect whether an identified obesity susceptibility 
gene has a different transcription levels and/or message stability compared to the same gene 
in a person not susceptibile to obesity. Individuals who carry the obesity susceptibility gene 
may exhibit differences in their ability to regulate protein biosynthesis under different 
physiological conditions and may display altered physiological abilities. 

In another example, the level of obesity gene expression can be assayed by detecting 
and measuring obesity transcription. For example, RNA from adipocytes from a person 
carrying the translocation can be isolated and tested utilizing hybridization or PCR 
techniques. Such analyses can reveal both quantitative and qualitative aspects of the 
expression pattern of the obesity gene, including activation or inactivation of obesity gene 
expression. 
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Pathway mapping 

Also within the invention is the identification and elucidation of the obesity 
biochemicaMgnal transduction pathway of which the obesity susceptibility protein is a 
component. In particular, the invention includes identifying the other components of the 
obesity susceptibility protein biochemical pathway. In this way it is possible to identify the 
specific critical signaling pathway which links the disease stimulus to the cell's response and 
enables the identification of new potential targets for therapy intervention. 

As used herein, a obesity susceptibility protein "target molecule" is a molecule in the 
obesity biochemical pathway with which the obesity susceptibility protein binds or interacts, 
directly or indirectly, with, or is a molecule that regulates the expression of the obesity gene 
or translation of the protein in nature. For example, the target molecule can be a protein 
which directly interacts with the obesity susceptibility protein, or can be a protein which does 
not itself directly interact with the obesity susceptibility protein, but which is a component of 
the obesity susceptibility protein biochemical pathway. In one example, the target molecule 
can be an intercellular protein or a protein which facilitates the association of downstream or 
upstream signaling molecules with the obesity susceptibility protein. 

According to a further aspect of the invention there is provided the use of the obesity 
susceptibility protein in research to identify further gene targets implicated in obesity. 
Methods for identifying proteins which interact with the obesity susceptibility protein are 
known in the art, e.g., the two-hybrid assay (see, e.g., U.S. Pat. No. 5,283,317; Zervos et aL 
Cell 72:223-232, 1993) or using cell culture techniques to identify binding partners. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct, the gene that codes for an obesity 
susceptibility protein is fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library of 
DNA sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene 
that codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact, in vivo, forming an obesity susceptibility protein - 
30 dependent complex, the DNA-binding and activation domains of the transcription factor are 
brought into close proximity. This proximity allows transcription of a reporter gene (e.g., 
LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be detected and cell colonies 
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containing the functional transcription factor can be isolated and used to obtain the cloned 
gene which encodes the "target" protein which interacts with the obesity susceptibility 
protein. 

Alternatively, binding partners for the obesity susceptibility protein can be identified 
using cell culture techniques or using cells obtained directly from an obesity patient The 
method includes isolating the obesity susceptibility protein of interest from the cell and 
determining the identity of its target molecule. Initial screening can be accomplished by 
Western blot analysis to analyse immunochemical^, using antibodies against the obesity 
susceptibility protein, the size of the obesity susceptibility protein ^target molecule complex. 
Further analysis of the complex will reveal the identity of the target molecule. 

The gene and protein encoded by the target molecule is also a potential target for 
therapeutic intervention in obesity disease, for instance in the development of antisense 
nucleic acid targeted to the mKNA; or more widely in the identification or development of 
chemical or hormonal therapeutic agents. The person skilled in the art is also capable of 
devising screening assays to identify compounds (chemical or biological) that modulate 
(activate or inhibit) the identified gene, which compounds may prove useful as therapeutic 
agents in treating or preventing obesity. 

RORal-UNCSC andRORaS nucleic acid sequence and polypeptide 

The invention encompasses a RORal-UNC5C nucleic acid sequence having at least 
65%, 70%, 75%, 80%, 85%, 90% or 99%, sequence identity to the RORocl-UNC5C nucleic 
acid sequence of SEQ ID NO: 1 or SEQ ED NO: 3. The invention also encompasses a 
RORocS nucleic acid sequence having at least 65%, 70%, 75%, 80%, 85%, 90% or 99%, 
sequence identity to the RORoc5 nucleic acid sequence of SEQ ID NO: 7. The comparison of 
sequences and determination of percent sequence identity between two sequences can be 
accomplished using a mathematical algorithm. In a preferred embodiment, the percent 
identity between two amino acid sequences is determined using the Needleman and Wunsch 
(J. Mol. Biol. (48):444-453 (1970)) algorithm which has been incorporated into the GAP 
program in the GCG software package (available at http://www.gcg.com), using either a 
Blosum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a 
length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity 
between two nucleotide sequences is determined using the GAP program in the GCG software 
package (available at ttp://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight 
of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the 
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percent identity between two amino acid or nucleotide sequences is determined using the 

algorithm of E. Meyers and W. Miller (CABIOS, 4:1 1-17 (1989)) which has been 

incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a 

gap length penalty of 12 and a gap penalty of 4. 

The invention also encompasses polynucleotides which encode the RORal-UNC5C 

protein of SEQ ID NO: 2 or SEQ ID NO: 4, or the RORocS protein of SEQ ID NO: 8. It will 

be appreciated by those skilled in the art that as a result of the degeneracy of the genetic code, 

a multitude of nucleotide sequences encoding RORal-UNC5C polypeptide or RORa5 

polypeptide, some bearing minimal homology to the nucleotide sequences of any known and 

naturally occurring gene, may be produced. Thus, the invention contemplates each and every 

possible variation of nucleotide sequence that could be made by selecting combinations based 

on possible codon choices. These combinations are made in accordance with the standard 

triplet genetic code as applied to the nucleotide sequence of naturally occurring RORal- 

UNC5C polypeptide or RORocS polypeptide, and all such variations are to be considered as 

being specifically disclosed. 

The invention also encompasses production of DNA sequences, or fragments thereof, 

which encode lRORod-UNC5C, sRORal-UNCSC orRORaS polypeptide and their 

derivatives, entirely by synthetic chemistry. After production, the synthetic sequence may be 

inserted into any of the many available expression vectors and cell systems using reagents that 

are well known in the art. 

Also encompassed by the invention are polynucleotide sequences that are capable of 

hybridizing to the claimed RORocl-UNCSC or RORaS nucleic acid seqeunce, and in 

particular, those shown in SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 7, under various 

conditions of stringency as taught in Wahl, G. M. and S. L. Berger (1987; Methods Enzymol. 

152:399-407) and Kimmel, A. R. (1987; Methods Enzymol. 152:507-51 1). Appropriate 

stringency conditions which promote DNA hybridization, for example, 6.0 X sodium 

chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0 X SSC at 50°C, are 

known to those skilled in the art or can be found in Current Protocols in Molecular Biology, 

John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in the wash 

step can be selected from a low stringency of about 2.0 SSC at 50°C to a high stringency of 

about 0.2 X SSC at 50°C. In addition, ihe temperature in the wash step can be increased from 

low stringency conditions at room temperature, about 22° C, to high stringency conditions at 
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about 65°C. Moderately stringent conditions are, for example at about 2.0 X SSC and about 
40°C 

Also included in the invention are RORal -UNC5C polypeptides having at least 65% 
amino acid sequence identity to the RORal -UNC5C polypeptide of SEQ ID NO: 2 or SEQ 
5 ID NO: 4 and which retains at least one biological or other functional characteristic or activity 
of the RORal -UNC5C polypeptide. A most preferred RORal -UNC5C variant is one having 
at least 65% 70%, 80% or 90% amino acid sequence identity to SEQ ID NO: 2or SEQ ID NO: 
4. 

Also included in the invention is RORaS polypeptide having at least 65% amino acid 

10 sequence identity to the RORa5 polypeptide of SEQ ID NO: 8 and which retains at least one 
biological or other functional characteristic or activity of the RORoc5 polypeptide. A most 
preferred RORaS variant is one having at least 65% 70%, 80% or 90% amino acid sequence 
identity to SEQ ID NO: 8. 

The invention also includes variants of the RORal-UNC5C polypeptide or the 

15 RORocS polypeptide which can contain one or more substitutions of amino acid residues 
which result in a silent change and a functionally equivalent RORal -UNC5C polypeptide or 
RORctS polypeptide respectively. Deliberate amino acid substitutions may be made on the 
basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues as long as the biological or immunological activity of 

20 RORal -UNC5C polypetide or RORa5 polypeptide is retained. For example, negatively 
charged amino acids may include aspartic acid and glutamic acid; positively charged amino 
acids may include lysine and arginine; and amino acids with uncharged polar head groups 
having similar hydrophilicity values may include leucine, isoleucine, and valine, glycine and 
alanine, asparagine and glutamine, serine and threonine, and phenylalanine and tyrosine. 

25 Obesity Proteins and Polypeptides 

Obesity proteins such as the RORal, RORaS, UNC5C, sUNC5C or the RORal - 
UNC5C fusion proteins can be prepared for a variety of uses, including but not limited to the 
generation of antibodies, as reagents in diagnostic assays, in the identification of other cellular 
gene products involved in the pathogenesis of obesity, as reagents in assays for screening for 

30 compounds that can be used in the treatment of obesity, and as pharmaceutical reagents useful 
in the treatment of obesity. 

A variety of host-expression vector systems may be utilized to express the obesity 
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nucleotide sequences of the invention. The expression systems that may be used for 
purposes of the invention include but are not limited to microorganisms such as bacteria (e.g., 
E. coli, B. subtilis) transformed with recombinant bacteriophage DNA, plasmid DNA or 
cosmid DNA expression vectors containing obesity nucleotide sequences or mammalian cell 
5 systems (e.g., COS, CHO, BHK, 293, 3T3) harbouring recombinant expression constructs 
containing promoters derived from the genome of mammalian cells (e.g., metallothionein 
promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the vaccinia virus 
7.5K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
id depending upon the use intended for the obesity gene product being expressed. For example, 
when a large quantity of such a protein is to be produced, for the generation of pharmaceutical 
compositions of obesity protein or for raising antibodies to the obesity protein, for example, 
vectors which direct the expression of high levels of fusion protein products that are readily 
purified may be desirable. Such vectors include, but are not limited to, the E. coli expression 
15 vector pUR278 (Ruther et al: y EMBO J. 2:1791, 1983), in which the obesity coding sequence 
may be ligated individually into the vector in frame with the lacZ coding region so that a 
fusion protein is produced; pIN vectors (Inouye & Inouye, Nucleic Acids Res. 13:3101-3109, 
1985; Van Heeke & Schuster, J. Biol. Chem. 264:5503-5509, 1989); and the like. PGEX 
vectors may also be used to express foreign polypeptides as fusion proteins with glutathione 
20 S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified 
from lysed cells by adsorption to glutathione-agarose beads followed by elution in the 
presence of free glutathione. The PGEX vectors are designed to include thrombin or factor Xa 
protease cleavage sites so that the cloned target gene product can be released from the GST 
moiety. 

25 In mammalian host cells, a number of viral-based expression systems may be utilized. 

In cases where an adenovirus is used as an expression vector, the obesity nucleotide sequence 
of interest may be ligated to an adenovirus transcription/translation control complex, e.g., the 
late promoter and tripartite leader sequence. This chimeric gene may then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of 

30 the viral genome (e.g., region El or E3) will result in a recombinant virus that is viable and 
capable of expressing the obesity gene product in infected hosts. (E.g., See Logan & Shenk, 
Proc. Natl. Acad. Sci. USA 81:3655-3659,1984). 

For long-term, high-yield production of recombinant proteins, stable expression is 
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preferred. For example, cell lines which stably express the obesity sequences described 
above may be engineered. Rather than using expression vectors which contain viral origins of 
replication, host cells can be transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. Following the introduction of the foreign 
DNA, engineered cells may be allowed to grow for 1-2 days in an enriched media, and then 
are switched to a selective media. The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably integrate the plasmid into their 
chromosomes and grow to form foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines which express the obesity 
gene product. Such engineered cell lines may be particularly useful in screening and 
evaluation of compounds that affect the endogenous activity of the obesity gene product 

Alternatively, any fusion protein may be readily purified by utilizing an antibody 
specific for the fusion protein being expressed. For example, a system described by Janknecht 
et al. allows for the ready purification of non-denatured fusion proteins expressed in human 
cell lines (Janknecht, et al. 9 Proc. Natl. Acad. Sci. USA 88:8972-8976, 1991). In this system, 
the gene of interest is subcloned into a vaccinia recombination plasmid such that the gene ! s 
open reading frame is translationally fused to an amino-terminal tag consisting of six histidine 
residues. Extracts from cells infected with recombinant vaccinia virus are loaded onto NL2+. 
nitfiloacetic acid-agarose columns and histidine-tagged proteins are selectively eluted with 
imidazole-containing buffers. 

The obesity gene products can also be expressed in transgenic animals. Animals of any 
species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, goats, 
and non-human primates, for example, baboons, monkeys, and chimpanzees may be used to 
generate obesity transgenic animals. 

Any technique known in the art may be used to introduce the obesity transgene into 
animals to produce the founder lines of transgenic animals. Such techniques include, but are 
not limited to pronuclear microinjection (U.S. Pat No. 4,873,191); retrovirus mediated gene 
transfer into germ lines (Van der Putten et ai 9 Proc. Natl. Acad. Sci. USA 82:6148-6152, 
1985); gene targeting in embryonic stem cells (Thompson et aL, Cell 56:313-321, 1989); 
electroporation of embryos (Lo, Mol. Cell. Biol. 3 :1803-1814, 1983); and sperm-mediated 
gene transfer (Lavitrano et ai, Cell 57:717-723, 1989); etc. For a review of such techniques, 
see Gordon, Transgenic Animals, Intl. Rev. Cytol. 1 15:171-229, 1989, which is incorporated 
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by reference herein in its entirety. 

The present invention provides for transgenic animals that carry the obesity transgene 
in all their cells, as well as animals which carry the transgene in some, but not all their cells, 
i.e., mosaic animals. The transgene may be integrated as a single transgene or in concatamers, 
e.g., head-to-head tandems or head-to-tail tandems. The transgene may also be selectively 
introduced into and activated in a particular cell type by following, for example, the teaching 
of Lasko et al (Lasko, M. et al. 9 Proc. Natl. Acad. Sci. USA 89:6232-6236, 1992). Once 
transgenic animals have been generated, the expression of the recombinant obesity gene may 
be assayed utilizing standard techniques. Initial screening may be accomplished by Southern 
blot analysis or PCR techniques to analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of mRNA expression of the transgene in the tissues of the 
transgenic animals may also be assessed using techniques which include but are not limited to 
Northern blot analysis of tissue samples obtained from the animal, in situ hybridization 
analysis, and RT-PCR. Samples of obesity gene^expressing tissue, may also be evaluated 
immunocytochemically using antibodies specific for the obesity transgene product. 
Antibodies to obesity Proteiits 

Antibodies that specifically recognize one or more epitopes of obesity proteins, or 
peptide fragments, are also encompassed by the invention. Such antibodies include but are not 
limited to polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric 
antibodies, single chain antibodies, Fab fragments, F(ab').2 fragments, fragments produced by 
a Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitppe-binding fragments of 
any.of the above. 

The antibodies of the invention may be used, for example, in the detection of the 
obesity in a biological sample and may, therefore, be utilized as part of a diagnostic or 
prognostic technique whereby patients may be tested for abnormal amounts of obesity. Such 
antibodies may also be utilized in conjunction with, for example, compound screening 
schemes for the evaluation of the effect of test compounds on expression and/or activity of the 
obesity gene product Additionally, such antibodies can be used in conjunction with the gene 
therapy techniques described, for example, evaluate the normal and/or engineered obesity - 
expressing cells prior to their introduction into the patient Such antibodies may additionally 
be used as a method for the inhibition of abnormal obesity activity. Thus, such antibodies 
may, therefore, be utilized as part of obesity treatment methods. 

Methods of making and detecting labelled antibodies are well known (Campbell; 
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Monoclonal Antibody Technology, in: Laboratory Techniques in Biochemistry and 
Molecular Biology, Volume 13. Eds: Burdon R et al Elsevier, Amsterdam (1984)). The term 
antibody includes both monoclonal antibodies, which are a substantially homogeneous 
population, and polyclonal antibodies which are heterogeneous populations. The term also 
includes inter alia, humanised and chimeric antibodies. Monoclonal antibodies to specific 
antigens may be obtained by methods known to those skilled in the art, such as from 
hybridoma cells, phage display libraries or other methods. Monoclonal antibodies may be 
inter alia, human, rat or mouse derived. For the production of human monoclonal antibodies, 
hybridoma cells may be prepared by fusing spleen cells from an immunised animal, e.g. a 
mouse, with a tumour cell. Appropriately secreting hybridoma cells may thereafter be 
selected (Koehler & Milstein, Nature 256:495-497 (1975); Cole et al, "Monoclonal 
antibodies and Cancer Therapy", Alan R Liss Inc, New York N.Y. pp 77-96 (1985)). Such 
antibodies may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any 
subclass thereof. 

Polyclonal antibodies can be generated by immunisation of an animal (such as a 
mouse, rat, goat, horse, sheep etc) with an antigen, such as one of the obesity proteins used in 
this invention. 

Rodent antibodies may be humanised using recombinant E)NA technology according 
to techniques known in the art Alternatively, chimeric antibodies, single chain antibodies, 
Fab fragments may also be developed against the polypeptides of the invention (Huse et al. 9 
Science 256:1275-1281 (1989)), using skills known in the art Antibodies so produced have a 
number of uses which will be evident to the molecular biologist or immunologist skilled in the 
art Such uses include, but are not limited to, monitoring enzyme expression, development of 
assays to measure enzyme activity and use as a therapeutic agent Enzyme linked 
immunosorbant assays (ELISAs) are well known in the art and would be particularly suitable 
for detecting the obesity protein or polypeptide fragments thereof in a test sample. 

Screening Assay 

The invention also provides a method for identifying modulators, i.e., test compounds 
(e.g., peptides, peptidomimetics, small molecules or other drugs) which modulate the activity 
of an obesity susceptibility gene. 

In one example, the invention provides methods for screening for test compounds for 
use in the treatment of obesity by screening for test compounds that modulate the activity of 
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the obesity protein, or a portion thereof. The test compounds of the present invention can be 
obtained using any of the numerous approaches in combinatorial library methods known in 
the art, including: biological libraries. The method can be a cell-based method or a cell free 
method The screening methods according to the invention may be operated using 
conventional procedures, for example by bringing the test compound or compounds to be 
screened and an appropriate substrate into contact with the obesity polypeptide, or a cell 
capable of producing it, or a cell membrane preparation thereof, and determining affinity for 
the obesity polypeptide in accordance with standard techniques. 

Any compound identified in this way may prove useful in the treatment of obesity in 
humans and/or other animals. The invention thus extends to a compound selected through its 
ability to regulate the activity of the obesity protein in vivo as primarily determined in a 
screening assay utilising an obesity polypeptide or a homologue or fragment thereof, or a gene 
coding therefore for use in the treatment of a disease in which the over- or under-activity or 
unregulated activity of the protein is implicated. 

According to a further aspect of the invention there is provided a screening assay or 
method for identifying potential anti- obesity therapeutic compounds comprising contacting 
an assay system capable of detecting the effect of a test compound against expression level of 
obesity, with a test compound and assessing the change in expression level of obesity. 
Compounds that modulate the expression of DNA or RNA of obesity polypeptides may be 
detected by a variety of assay systems. A suitable assay system may be a simple "yes/no" 
assay to determine whether there is a change in expression of a reporter gene, such as beta- 
galactosidase, luciferase, green fluorescent protein or others known to the person skilled in the 
art (reviewed by Naylor, Biochem. Pharmacol. 58:749-57,1999). The assay system may be 
made quantitative by comparing the expression or function of a test sample with the levels of 
expression or function in a standard sample. Systems in which transcription factors are used 
to stimulate a positive output, such as transcription of a reporter gene, are generally referred to 
as "one-hybrid systems" (Wang, M.M. and Reed, R.R. Nature 364:121-126, 1993). Using a 
transcription factor to stimulate a negative output (growth inhibition) may thus be referred to 
as a "reverse one-hybrid system" (Vidal et al, supra). Therefore, in an embodiment of the 
present invention, a reporter gene is placed under the control of the obesity promoter. 

In a further aspect of the invention we provide a cell or cell line comprising a reporter 
gene under the control of the obesity promoter. 
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According to another aspect of the present invention there is provided a method of 
screening for a compound potentially useful for treatment of obesity which comprises 
assaying the compound for its ability to modulate the activity or amount of obesity. 
Preferably the assay is selected from: 

i) measurement of obesity activity using a cell line which expresses the obesity 
polypeptide or using purified obesity polypeptide; and 

ii) measurement of obesity transcription or translation in a cell line expressing the 
obesity polypeptide. 

Thus, in a further aspect of the invention, cell cultures expressing the obesity polypeptide 
can be used in a screen for therapeutic agents. Effects of test compounds may be assayed by 
changes in mRNA or protein of obesity. As described above, cells (i.e. mammalian, bacterial 
etc) can be engineered to express the obesity polypeptide. 

Thus, according to a further aspect of the invention there is provided a method of testing 
potential therapeutic agents for the ability to suppress the obesity phenotype comprising 
contacting a test compound with a cell engineered to express the obesity polypeptide; and 
determining whether said test compound suppressed expression of the obesity polypeptide. 

We also provide a method for identifying inhibitors of transcription of obesity, which 
method comprises contacting a potential therapeutic agent with a cell or cell line as described 
above and determining inhibition of obesity transcription by the potential therapeutic agent by 
reference to a lack of or reduction in expression of the reporter gene. 

Any convenient test compound or library of test compounds may be used in conjunction 
with the test assay. Particular test compounds include low molecular weight chemical 
compounds (preferably with a molecular weight less than 1500 daltons) suitable as 
pharmaceutical or veterinary agents for human or animal use, or compounds for non- 
administered use such as cleaning/sterilising agents or for agricultural use. Test compounds 
may also be biological in nature, such as antibodies. 

According to a further aspect of the invention there is provided a compound identified by 
a screening method as defined herein. 

According to another aspect of the present invention there is provided use of a compound 
able to modulate the activity or amount of obesity in the preparation of a medicament for the 
treatment of obesity. Modulation of the amount of obesity by a compound may be brought 
about for example through altered gene expression level or message stability. Modulation of 
the activity of obesity by a compound may also be brought about for example through 
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compound binding to the obesity protein. In one embodiment, modulation of obesity 
comprises use of a compound able to reduce the activity or amount of obesity. In another 
embodiment, modulation of obesity comprises use of a compound able to increase the activity 
or amount of obesity. 
Diagnostic test 

To determine if an individual is susceptible to obesity determination of the presence of 
the translocation junction between chromosome 4q22.3 and chromosome 15q22.2 can be 
performed to predict whether the individual has a high risk obesity chromosome. Moreover, 
determination of the translocation breakpoint can be used to identify the underlying cause of 
obesity in a person who is obese. 

A diagnostic test method lends itself readily to the formulation of kits which can be 
utilized in diagnosis. The diagnostic kits may comprise appropriate packaging and 
instructions for use in the methods of the invention. Such kits may further comprise 
appropriate buffer(s) and polymerase(s) such as thermostable polymerases, for example taq 
polymerase. 

Methods for the diagnosis of obesity may, for example, utilize reagents such as the 
nucleotide sequences which span the breakpoint of sRORal -UNC5C or IRORal -UNC5C 
antibodies. Specifically, such reagents may be used, for example, for: (1) the detection of the 
presence of short or long RORocl -UNC5C transcript, (2) the detection of perturbations or 
abnormalities in the signal transduction pathway mediated by RORocl -UNC5C or UNC5C. 

For the detection of mutations, any nucleated cell can be used as a starting source for 
genomic nucleic acid. For the detection of an obesity susceptibility gene such as sRORal - 
UNC5C or IRORal -UNC5C gene products, any cell type or tissue in which the obesity gene 
is expressed can be used. 
Detection of the obesity Gene and Transcripts 

Obesity susceptibility gene transcripts can be detected by utilizing a number of 
techniques. Nucleic acid from any nucleated cell can be used as the starting point for such 
assay techniques, and may be isolated according to standard nucleic acid preparation 
procedures which are well known to those of skill in the art. 
i DNA may be used in hybridization or amplification assays of biological samples to detect 
abnormalities involving obesity gene structure or gene regulatory elements, including point 
mutations, insertions, deletions and chromosomal rearrangements. Such assays may include, 
but are not limited to, Southern analyses, single stranded conformational polymorphism 
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analyses (SSCP), and PCR analyses. 

Such diagnostic methods for the detection of obesity gene-specific mutations can involve 
for example, contacting and incubating nucleic acids including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, obtained from a sample, for example, derived 
from a patient sample or other appropriate cellular source, with one or more labelled nucleic 
acid reagents including recombinant DNA molecules, under conditions favourable for the 
specific annealing of these reagents to their complementary sequences within the obesity, 
gene. Preferably, the lengths of these nucleic acid reagents are at least 15 to 30 nucleotides. 
After incubation, all non-annealed nucleic acids are removed from the nucleic acid: obesity 
molecule hybrid The presence of nucleic acids which have hybridized, if any such molecules 
exist, is then detected. Using such a detection scheme, the nucleic acid from the cell type or 
tissue of interest can be immobilized, for example, to a solid support such as a membrane, or a 
plastic surface such as that on a microtiter plate or polystyrene beads. Detection of the 
remaining, annealed, labelled obesity nucleic acid reagents is accomplished using standard 
techniques well-known to those in the art. The obesity gene sequences to which the nucleic 
acid reagents have annealed can be compared to the annealing pattern expected from a normal 
obesity gene sequence in order to determine whether an obesity gene mutation is present 
Detection of the obesity Gene Products 

Antibodies directed against wild type or mutant obesity gene products or conserved 
variants or peptide fragments thereof, can also be used as obesity diagnostics. Such 
diagnostic methods, can be used to detect abnormalities in the level of obesity gene 
expression, or abnormalities in the structure and/or temporal, tissue, cellular, or subcellular 
location of the obesity, and may be performed in vivo or in vitro 9 such as, for example, on 
biopsy tissue. 

. For example, antibodies directed to epitopes of the obesity can be used in vivo to detect 
the pattern and level of expression of the obesity in the body. Such antibodies can be labelled, 
and injected into a subject in order to visualize binding to the obesity expressed in the body 
using methods such as X-rays, CAT-scans, or MRI. 

Additionally, any obesity fusion protein or obesity conjugated protein whose presence 
can be detected, can be administered. For example, obesity fusion or conjugated proteins 
labelled with a radio-opaque or other appropriate compound can be administered and 
visualized in. vivo, as discussed, above for labelled antibodies. 
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Alternatively, immunoassays or fusion protein detection assays, as described above, can 
be utilized on biopsy and autopsy samples in vitro to permit assessment of the expression 
pattern of the obesity. Such assays are not confined to the use of antibodies that define the 
obesity, but can include the use of antibodies directed to epitopes of any region of the obesity. 

The tissue or cell type to be analyzed will generally include those which are known, or 
suspected, to express the obesity gene. The protein isolation methods employed herein can, 
for example, be such as those described in Harlow and Lane (Harlow, E. and Lane, D., 1988, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y.), which is incorporated herein by reference in its entirety. The isolated cells can 
be derived from cell culture or from a patient. The analysis of cells taken from culture may be 
a necessary step in the assessment of cells that could be used as part of a cell-based gene 
therapy technique or, alternatively, to test the effect of compounds on the expression of the 
obesity gene. 
Gene therapy 

The invention encompasses gene therapy methods and compositions for treating and 
preventing obesity. 

In one example, the loss of normal obesity gene product function can result in the 
development of obesity. Increasing obesity gene product activity, or activation of the obesity 
pathway (e.g., downstream activation) would therefore facilitate progress in individuals 
exhibiting a deficient level of obesity gene expression and/or obesity activity. 

Alternatively, obesity may be ameliorated by decreasing the level of obesity gene 
expression, and/or obesity gene activity, and/or downregulating activity of the obesity 
pathway (e.g., by targeting downstream signalling events). Different approaches are discussed 
below. 

Inhibition of obesity Expression or obesity Activity 

Any method that neutralizes or inhibits expression of the obesity gene (either 
transcription or translation), e.g. RORocl, RORoc5 and RORal-UNC5C, can be used to 
prevent or treat obesity. 

For example, the administration of soluble peptides, proteins, fusion proteins, or 
antibodies (including anti-idiotypic antibodies) that bind to and "neutralize" circulating 
obesity can be used. Such obesity neutralizing peptides, proteins, fusion proteins, anti- 
idiotypic antibodies or Fabs are administered to a subject in amounts sufficient to treat or 
prevent obesity. 
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In an alternate embodiment, therapy can be designed to reduce the level of endogenous 
obesity gene expression, for example, using antisense or ribozyme approaches to inhibit or 
prevent translation of obesity mRNA-transcripts; triple helix approaches to inhibit 
transcription of the obesity gene; or targeted homologous recombination to inactivate or 
"knock out" the obesity gene or its endogenous promoter. Antisense, ribozyme or DNA 
constructs can be administered directly to the site containing the target cells; or can be 
directed to the target cells. 

Antisense approaches involve the design of oligonucleotides (either DNA or RNA) that 
are complementary to obesity mRNA. The antisense oligonucleotides will bind to the 
complementary obesity mRNA transcripts and prevent translation. Absolute complementarity, 
although preferred, is not required. A sequence "complementary" to a portion of an RNA, as 
referred to herein, means a sequence having sufficient complementarity to be able to 
hybridize with the RNA, forming a stable duplex; in the case of double-stranded antisense 
nucleic acids, a single strand of the duplex DNA may thus be tested, or triplex formation may 
be assayed The ability to hybridize will depend on both the degree of complementarity and 
the length of the antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the 
more base mismatches with an RNA it may contain and still form a stable duplex (or triplex, 
as the case may be). One skilled in the art can ascertain a tolerable degree of mismatch by use 
of standard procedures to determine the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, for example, the 5' 
untranslated sequence up to and including the AUG initiation codon, should work most 
efficiently at inhibiting translation. However, sequences complementary to the 3 T untranslated 
sequences of mRNAs have been shown to be effective at inhibiting translation of mRNAs as 
well. Thus, oligonucleotides complementary to either the 5'- or 3 ! -non-translated, non-coding 
regions of the obesity can be used in an antisense approach to inhibit translation of 
endogenous obesity mRNA. Oligonucleotides complementary to the 5 1 untranslated region of 
the mRNA should include the complement of the AUG start codon. Antisense 
oligonucleotides complementary to mRNA coding regions are less efficient inhibitors of 
translation but could be used in accordance with the invention. Whether designed to hybridize 
to the 5'-, 3'- or coding region of obesity mRNA, antisense nucleic acids should be at least six 
nucleotides in length, and are preferably oligonucleotides ranging from 6 to about 50 
nucleotides in length. In specific aspects, the oligonucleotide is at least 10 nucleotides, at least 
17 nucleotides, at least 25 nucleotides, or at least 50 nucleotides in length. 
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Regardless of the choice of target sequence, it is preferred that in vitro studies are first 
performed to quantitate the ability of the antisense oligonucleotide to inhibit gene expression. 
It is preferred that these studies utilize controls that distinguish between antisense gene 
inhibition and nonspecific biological effects of oligonucleotides. It is also preferred that these 
studies compare levels of the target RNA or protein with that of an internal control RNA or 
protein. Additionally, it is envisioned that results obtained using the antisense oligonucleotide 
are compared with those obtained using a control oligonucleotide. It is preferred that the 
control oligonucleotide is of approximately the same length as the test oligonucleotide and 
that the nucleotide sequence of the oligonucleotide differs from the antisense sequence no 
more than is necessary to prevent specific hybridization to the target sequence. 

While antisense nucleotides complementary to the obesity coding region sequence could 
be used, those complementary to the transcribed untranslated region are most preferred. The 
antisense molecules should be delivered to cells which express the obesity in vivo. A number 
of methods have been developed for delivering antisense DNA or RNA to cells; for example, 
antisense molecules can be injected directly into the tissue site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or antibodies 
that specifically bind receptors or antigens expressed on the target cell surface) can be 
administered systemically. 

Ribozyme molecules designed to catalytically cleave obesity mRNA transcripts can also 
be used to prevent translation of obesity mRNA and expression of obesity. (See, e.g., PCT 
International Publication WO90/1 1364, published Oct. 4, 1990; Sarver et al 9 1990, Science 
247:1222-1225). While ribozymes that cleave mRNA at site specific recognition sequences 
can be used to destroy obesity mRNAs, the use of hammerhead ribozymes is preferred. 
Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5'-UG-3\ The construction and production 
of hammerhead ribozymes is well known in the art and is described more fully in Haseloff 
and Gerlach, 1988, Nature 334:585-591. There are hundreds of potential hammerhead 
ribozyme cleavage sites within the nucleotide sequence of human obesity cDNA. Preferably 
the ribozyme is engineered so that the cleavage recognition site is located near the 5' end of 
the OBESITY mRNA; i.e. s to increase efficiency and minimiz e the intracellular accumulation 
of non-functional mRNA transcripts. 
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The ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one which occurs naturally in Tetrahymena 
Thennophila (known as the IVS, or L-19 IVS RNA) and which has been extensively 
described by Thomas Cech and collaborators (Zaug, et aL, 1984, Science 224:574-578; Zaug 
and Cech, 1986, Science 231:470-475; Zaug, et aL, 1986, Nature 324:429-433; published 
International patent application No. WO 88/04300 by University Patents Inc.; Been and Cech, 
1986, Cell 47:207-216). The Cech-type ribozymes have an eight basepair active site that 
hybridizes to a target RNA sequence, whereafter cleavage of the target RNA takes place. The 
invention encompasses those Cech-type ribozymes that target eight basepair active site 
sequences that are present in obesity. 

As in the antisense approach, the ribozymes can be composed of modified 
oligonucleotides (e.g. for improved stability, targeting, etc.) and should be delivered to cells 
which express the obesity in vivo. A preferred method of delivery involves using a DNA 
construct "encoding" the ribozyme under the control of a strong constitutive pol EI or pol II 
promoter, so that transfected cells will produce sufficient quantities of the ribozyme to destroy 
endogenous obesity messages and inhibit translation. Because ribozymes, unlike antisense 
molecules, are catalytic, a lower intracellular concentration is required for efficiency. 

In an alternative embodiment for neutralizing circulating obesity, cells that are 
genetically engineered to express such soluble or secreted forms of obesity can be 
administered to a patient, whereupon they will serve as "bioreactors" in vivo to provide a 
continuous supply of the neutralizing protein. 

Endogenous obesity gene expression can also be reduced by inactivating or "knocking 
out" the obesity gene or its promoter using targeted homologous recombination. (E.g., see 
Smithies et aL 9 1985, Nature 317:230-234; Thomas & Capecchi, 1987, Cell 51:503-512; 
Thompson et aL, 1989, Cell 5:313-321; each of which is incorporated by reference herein in 
its entirety).. . 

In yet another embodiment of the invention, the activity of obesity can be reduced using a 
"dominant negative" approach to effectuate obesity. To this end, constructs that encode 
defective obesity can be used in gene therapy approaches to diminish the activity of the 
obesity in appropriate target cells. 

Pharmaceutical composition containing the Identified Candidate Compound 

The compounds having the desired activity may be administered in a physiologically 
acceptable carrier to an obesity patient. Such compositions of the invention may be in a form 
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suitable for oral use (for example as tablets, lozenges, hard or soft capsules, aqueous or oily 
suspensions, emulsions, dispersible powders or granules, syrups or elixirs), for topical use (for 
example as creams, ointments, gels, or aqueous or oily solutions or suspensions), for 
administration by inhalation (for example as a finely divided powder or a liquid aerosol), for 
administration by insufflation (for example as a finely divided powder) or for parenteral 
administration (for example as a sterile aqueous or oily solution for intravenous, 
subcutaneous, intramuscular or intramuscular dosing or as a suppository for rectal dosing). 

The compositions of the invention may be obtained by conventional procedures using 
conventional pharmaceutical excipients, well known in the art. Thus, compositions intended 
for oral use may contain, for example, one or more colouring, sweetening, flavouring and/or 
preservative agents. 

Suitable pharmaceutically acceptable excipients for a tablet formulation include, for 
example, inert diluents such as lactose, sodium carbonate, calcium phosphate or calcium 
carbonate, granulating and disintegrating agents such as corn starch or algenic acid; binding 
agents such as starch; lubricating agents such as magnesium stearate, stearic acid or talc; 
preservative agents such as ethyl or propyl p-hydroxybenzoate, and anti-oxidants, such as 
ascorbic acid. Tablet formulations may be uncoated or coated either to modify their 
disintegration and the subsequent absorption of the active ingredient within the 
gastrointestinal track, or to improve their stability and/or appearance, in either case, using 
conventional coating agents and procedures well known in the art. 

Compositions for oral use may be in the form of hard gelatin capsules in which the active 
ingredient is mixed with an inert solid diluent, for example, calcium carbonate, calcium 
phosphate or kaolin, or as soft gelatin capsules in which the active ingredient is mixed with 
water or an oil such as peanut oil, liquid paraffin, or olive oil. 

Aqueous suspensions generally contain the active ingredient in finely powdered form 
together with one or more suspending agents, such as sodium carboxymethylcellulose, 
methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinyl-pyrrolidone, gum 
tragacanth and gum acacia; dispersing or wetting agents such as lecithin or condensation 
products of an alkylene oxide with fatty acids (for example polyoxethylene stearate), or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 
heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 
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heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products of ethylene oxide with partial esters derived from fatty acids and 
hexitol anhydrides, for example polyethylene sorbitan monooleate. The aqueous suspensions 
may also contain one or more preservatives (such as ethyl or propyl p-hydroxybenzoate, anti- 
oxidants (such as ascorbic acid), colouring agents, flavouring agents, and/or sweetening 
agents (such as sucrose, saccharine or aspartame). 

Oily suspensions may be formulated by suspending the active ingredient in a vegetable 
oil (such as arachis oil, olive oil, sesame oil or coconut oil) or in a mineral oil (such as liquid 
paraffin). The oily suspensions may also contain a thickening agent such as beeswax, hard 
paraffin or cetyl alcohol. Sweetening agents such as those set out above, and flavouring 
agents may be added to provide a palatable oral preparation. These compositions may be 
preserved by the addition of an anti-oxidant such as ascorbic acid 

Dispersible powders and granules suitable for preparation of an aqueous suspension by 
the addition of water generally contain the active ingredient together with a dispersing or 
wetting agent, suspending agent and one or more preservatives. Suitable dispersing or 
wetting agents and suspending agents are exemplified by those already mentioned above. 
Additional excipients such as sweetening, flavouring and colouring agents, may also be 
present. 

The pharmaceutical compositions of the invention may also be in the form of 
oil-in-water emulsions. The oily phase may be a vegetable oil, such as olive oil or arachis oil, 
or a mineral oil, such as for example liquid paraffin or a mixture of any of these. Suitable 
emulsifying agents may be, for example, naturally-occurring gums such as gum acacia or gum 
tragacanth, naturally-occurring phosphatides such as soya bean, lecithin, an esters or partial 
esters derived from fatty acids and hexitol anhydrides (for example sorbitan monooleate) and 
condensation products of the said partial esters with ethylene oxide such as polyoxyethylene 
sorbitan monooleate. The emulsions may also contain sweetening, flavouring and 
preservative agents. 

Syrups and elixirs may be formulated with sweetening agents such as glycerol, propylene 
glycol, sorbitol, aspartame or sucrose, and may also contain a demulcent, preservative, 
flavouring and/or colouring agent. 

The pharmaceutical compositions may also be in the form of a sterile injectable aqueous 
or oily suspension, which may be formulated according to known procedures using one or 
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more of the appropriate dispersing or wetting agents and suspending agents, which have 
been mentioned above. A sterile injectable preparation may also be a sterile injectable 
solution or suspension in a non-toxic parenterally-acceptable diluent or solvent, for example a 
solution in 1 ,3-butanediol. 

Suppository formulations may be prepared by mixing the active ingredient with a suitable 
non-irritating excipient which is solid at ordinary temperatures but liquid at the rectal . 
temperature and will therefore melt in the rectum to release the drug. Suitable excipients 
include, for example, cocoa butter and polyethylene glycols. 

Topical formulations, such as creams, ointments, gels and aqueous or oily solutions or 
suspensions, may generally be obtained by formulating an active ingredient with a 
conventional, topically acceptable, vehicle or diluent using conventional procedure well 
known in the art.. 

Compositions for administration by insufflation may be in the form of a finely divided 
powder containing particles of average diameter of, for example, 30\i or much less, the 
powder itself comprising either active ingredient alone or diluted with one or more 
physiologically acceptable carriers such as lactose. The powder for insufflation is then 
conveniently retained in a capsule containing, for example, 1 to 50mg of active ingredient for 
use with a turbo-inhaler device, such as is used for insufflation of the known agent sodium 
cromoglycate. 

Compositions for administration by inhalation may be in the form of a conventional 
pressurised aerosol arranged to dispense the active ingredient either as an aerosol containing 
finely divided solid or liquid droplets. Conventional aerosol propellants such as volatile 
fluorinated hydrocarbons or hydrocarbons may be used and the aerosol device is conveniently 
arranged to dispense a metered quantity of active ingredient. 

For further information on Formulation the reader is referred to Chapter 25.2 in Volume 5 
of Comprehensive Medicinal Chemistry (Corwin Hansch; Chairman of Editorial Board), 
Pergamon Press 1990. 

The amount of active ingredient that is combined with one or more excipients to produce 
a single dosage form will necessarily vary depending upon the host treated and the particular 
route of administration. For example, a formulation intended for oral administration to 
humans will generally contain, for example, from 0.5 mg to 2 g of active agent compounded 
with an appropriate and convenient amount of excipients which may vary from about 5 to 
about 98 percent by weight of the total composition. Dosage unit forms will generally contain 
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about 1 mg to about 500 mg of an active ingredient. For further information on Routes of 
Administration and Dosage Regimes the reader is referred to Chapter 25.3 in Volume 5 of 
Comprehensive Medicinal Chemistry (Corwin Hansch; Chairman of Editorial Board), 
Pergamon Press 1990. 
5 The size of the dose for therapeutic or prophylactic purposes of a compound will 
naturally vary according to the nature and severity of the conditions, the age and sex of the 
animal or patient and the route of administration, according to well known principles of 
medicine. 

In using a compound for therapeutic or prophylactic purposes it will generally be 
10 administered so that a daily dose in the range, for example, 0.5 mg to 75 mg per kg body 
weight is received, given if required in divided doses. In general lower doses will be 
administered when a parenteral route is employed. Thus, for example, for intravenous 
administration, a dose in the range, for example, 0.5 mg to 30 mg per kg body weight will 
generally be used. Similarly, for administration by inhalation, a dose in the range, for 
15 example, 0.5 mg to 25 mg per kg body weight will be used Oral administration is however 
preferred. 

It will be obvious to those skilled in the art to which the invention pertains, that 
various changes and modifications may be made without departing from the scope of the 
invention defined by the claims. 

20 

EXAMPLES 
Example 1 

Identification of translocation breakpoints 

A male patient having a BMI of 48 was karyotyped and abnormal chromosomes were 

25 identified. It was found that the abnormal chromosomes had the translocation products 

between chromosome 4q22 and chromosome 15q22. The patient's sister and mother also had 
abnormally high BMI, and were also karyotyped (Fig. 1). It was subsequently found that the 
sister and mother were also carrying the balanced translocation. The breakpoints of each 
chromosome were further mapped with fish probes and southern blots and the breakpoints on 

30 each chromosome were sequenced. Using bacterial artificial chromosomes (BAC) clones 
spanning both breakpoints, the breakpoints were sequenced and assembled with the help of 
the public sequence to generate ordered contigs of the breakpoints region of chromosome 4 
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and 15. RORal on chromosome 15 was found to be affected since the breakpoint is within 
the first intron of isoform 1 resulting in the destruction of one copy of RORal. 
Example 2 

Gene expression analysis 

Preadipocytes were isolated from the affected boy and girl and used to study adipocyte 
function. Gene expression was monitored for all genes mapped to the vicinity of the 
breakpoints both at the preadipocyte stage and after induction into mature adipocytes. The 
isolated adipocytes showed an altered phenotype compared with normal adipocytes both in 
general morphology, function and gene expression. Lipid droplets were incorporated as tiny 
droplets instead of the large ones normally seen. UNC5C was highly upregulated in the 
preadipocytes both before and after induction into mature adipocytes when compared with an 
adipocyte control cell line. 

Two differently sized UNC5C transcripts could be detected in preadipocytes isolated 
from the patients. These were cloned and sequenced and shown to be a fusion between 
RORal exon 1 and UNC5C exon 2 giving rise to a novel protein. - Both encoded a fusion 
transcript with the structure RORal -UNC5C fusion (SEQ ID NO: 1 and SEQ ID NO: 3). The 
shorter of the two(SEQ ED NO: 3) is a splice version using alternative splice sites within 
exons 1 1 and 12 of UNC5C. 
Patient Material 

All three patients from the family were carriers of the translocation. The family consisted 
of affected mother and two affected siblings, the father is normal build. Lymphoblastoid cell 
lines were established according to standard procedures. 
FISHProbes 

For yeast artificial chromosome (YAC) screening, YACs from chromosome 4 and 15 
from the CEPH human YAC library was used (YAC clones chromosome 4: 244C10, 676B9, 
767G2, 783H11, 840F2, 854F4, 858A2,l71Al, 881 A12 and chromosome 15: 667B2, 
798C9, 801B4, 809G4, 827C7, 848E5, 854A11, 895G6, 897E9, 934G1, 937E4, 951F3). STS 
markers contained in the spanning YAC clones was used for PCR screening of bacterial 
artificial chromosome (BAC) libraries. 

The CEPH Human BAC library was screened using STS markers D4S2407 and D4S1 559 
from YAC clone 858A2 on chromosome 4 and WI-7454 from YAC clone 798C9 on 
chromosome 15. STS marker AFMa244wc9 from chromosome 15 was used to screen BAC 
Human II library from Genome Systems, Inc. 
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Restriction digestion with BamHI and HindEn was used to assemble a contig of the 
B AC clones and flanking clones from each contig were selected for BAC end walk. 
BAC end sequencing was performed on whole BACs purified with Qiagen Large Construct 
Purification kit (Qiagen) using sp6 and T7 primers. 

The end sequences of the BACs were used to screen the nr (non-redundant) and htgs 
(high throughput genomic sequences) databases at the NCBI/NIH server using the BLASTN 
program. Positive BAC hits were ordered from B ACPAC Resources (Children's Hospital 
Oakland) and used as FISH probes. 

To further map the breakpoints, sequences of approximately 6 kb from the spanning 
BACs RP1 1-402L1 1 (AC093832) and RP1 1-90A19 (AC079068) was derived by long range 
PGR and used as probes for FISH. 

Mini FISH probes used for chromosome 4: 



Forward primer 


Reverse primer 


SEQIDNO: 15 


SEQIDNO: 16 


SEQIDNO: 17 


SEQIDNO: 18 


SEQ ID NO: 19 


SEQIDNO: 20 


SEQIDNO: 21 


SEQ ID NO: 22 


SEQ ID NO: 23 


SEQ ID NO: 24' 


SEQIDNO: 25 


SEQ ID NO: 26 



Mini FISH probes used for chromosome 15: 



Forward primer 


Reverse primer 


SEQIDNO: 27 


SEQIDNO: 28 


SEQ ID NO: 29 


SEQ ID NO: 30 


SEQ ID NO: 31 


SEQ ID NO: 32 


SEQ ID NO: 33 


SEQIDNO: 34 


SEQ ID NO: 35 


SEQ ID NO: 36 
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SEO ID NO: 37 


SEQ ID NO: 38 


SEQIDNO: 39 


SEQ ID NO: 40 


SEQ ID NO: 41 


SEQ ID NO: 42 


SEQIDNO: 43 


SEQ ID NO: 44 



FISH analysis 



FISH was performed on metaphase chromosome spreads from the boy which is carrying 
the translocation. Metaphase spreads were prepared from lymphocytes according to standard 
methods. 



Southern blot 

The probes used in southern blot were amplified by PCR from BAC clones RP1 1-402L1 1 
(chromosome 4) and RP1 1-90A19 (chromosome 15) and labelled by random priming. 
The probes used for chromosome 4 were: 



Forward primer 


Reverse primer 


SEQ ID NO: 45 


SEQ ID NO: 46 


SEQ ID NO: 47 


SEQIDNO: 48 


SEQ ID NO: 49 


SEQ ID NO: 50 


SEQ ID NO: 51 


SEQIDNO: 52 


SEQ ID NO: 53 


SEQIDNO: 54 



The probes used for chromosome 15 were: 



Forward primer 


Reverse primer 


SEO ID NO: 55 


SEQIDNO: 56 


SEQ ID NO: 57 


SEQIDNO: 58 


SEO ID NO: 59 


SEQIDNO: 60 


SEQ ID NO: 61 


SEQ ED NO: 62 
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PCR and Sequencing of the Breakpoints 

The breakpoints could be bridged by long range PCR with chromosome derivate specific 
primers. For the chromosome 4 derivate the chromosome 4 primer SEQ ID NO: 63 and the 
chromosome 1 5 primer SEQ ID NO: 64 was used, and for the chromosome 1 5 derivate the 
chromosome 15 primer SEQ ED NO: 65 and the chromosome 4 primer SEQ ID NO: 66 was 
used. 

The breakpoint on the chromosome 4 derivate PCR product could be sequenced using 
primer SEQ ID NO: 51 and the chromosome 15 derivate PCR product could be sequenced 
using primer SEQ ID NO: 52. 
RNA isolation 

Total RNA was isolated from cells using Trizole reagent (Gibco) according to the 
manufacturer's description. The isolated RNA was dissolved in DEPC treated water and 
concentration was determined by spectrophotometric measurement at 260nm. All samples 
were also electrophoresed in 1% agarose gel to assess integrity of ribosomal bands. 
DNAse treatment and cDNA synthesis 

1 5pg of total RNA was treated with DNase to remove contaminating genomic DNA 
before cDNA synthesis. DNA-free™ from Ambion was used for this purpose. Shortly; 0.1 
volume of 10 times DNase buffer, 6 units of DNase and water up to 30pl was added to each 
sample. Samples were incubated at 37°for 40 minutes. DNase inactivation reagent was added 
to each sample , and after a 2 minutes incubation at room temperature it was removed by 
centrifugation. 

First strand cDNA synthesis was performed with SuperScript™First-Strand Synthesis. 
System for RT-PCR (Invitrogen, Life Technologies) according to enclosed protocol. 2jig of 
DNase treated total RNA was used in each synthesis. Priming of synthesis was accomplished 
with 01igo(dT)i2-i8 . For each synthesis reaction a negative control with no enzyme was set up. 
PCR and cloning 

Full length cDNA of the fusion transcript was obtained with TaqPlus® Precision PCR 
System( Stratagene). In a total of 50pl, 1 time TaqPlus Precision buffer, 200pM of each 
dNTP (Applied Biosystems), 300nM forward primer SEQ ID NO: 67, 300nM rpverse primer 
SEQ ID NO: 68 and 2.5U TaqPlus Precision polymerase mixture. cDNA from above was 
used as template, 0.4pl/reaction (equivalent with 40ng total RNA). The reaction mixture was 
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then subjected to thermal cycling in an PTC-200 from MJ research. Thermal cycling 
included one step initial denaturation one minute at 95°C, 35 cycles with 95°C one minute, 
58°C one minute, 72°C three minutes and a final extension often minutes at 72°. The reaction 
mixture was then loaded on a 1% Low melt Agarose gel (BioRad). Inspection of the gel 
5 showed 2 bands of approximately 2.8 and 3 kb. Both bands were cut out and ligated into 
pcR2.1-TOPO® (Invitrogen) and sequentially transfected into TOP 10 (Invitrogen). 

Both constructs were sequenced using Big Dye chemistry and 3 100 Genetic 
Analyzer(Applied Biosystems). 
Quantification 

10 cDNA from above was also used for relative quantification on Applied Biosystems 7700. 
Quantitative real-time PCR was carried out using SYBR® Green PCR Master Mix in 
25>iL reactions run in triplicates. Template for the PCR was first strand cDNA from above 
(amount equivalent of lOng total RNA). Forward and reverse primers specific for UNC5c was 
used at a concentration of 400nM. As negative control for amplification of remaining 
15 genomic DNA, no enzyme controls from cDNA synthesis was used. 
Shotgun library construction 

Human Bac clones where cultured over night in LB-media. DNA was prepared using 
Maxi-plasmid prep from Qiagen (VWR). 
5pg purified DNA was fragmented with a nebulizer. 
20 DNA was fragmented to 1 to 3 kb fragments, checked on 1% agarose gel. 

All according to the manufacturer's description in TOPO ® Shotgun Sub cloning Kit 
(Invitrogen). DNA was Blunt-end repaired, dephosphorylated and transformed into One Shot 
competent E.coli, also from the kit 

The transformation reaction was plated on LB plates and incubated over night 
25 A colony picker Q-PDC (Genetix) picked the colonies into 96-well micro plates with LB- 
media. 

One more incubation 37°C over night. 

Hydra, a dispenser moved the bacterial suspension to 96-deep well plates and a 34-hour 
37°C shake incubation was performed. After centrifugation, continued with DNA preparation 



30 
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DNA preparation 

Add 50 pi water, vortex 15 minutes. When complete resuspension add 70 pi Stet- 
Tween20 with 5,2mg/ml Rnase and 8,4mg/ml Lysozyme. Denaturate in microwave 
1 minute add 335 pi water, vortex and let stand in ice for 10 minutes. Spin boxes for 30 
5 minutes x 4000rpm. The clear supernatant moves to new 96- well micro plates to be 
sequenced. 
Sequencing 

Big Dye chemistry primer T3, primer T7 and ABI PRISM 3700 DNA Analyzer (Applied 
Biosystems) was used for sequencing clones. 
10 2x96 plate 

Big Dye mix 220pl (v.2.0) 

Water 220pl 

Buffer x 2,5 660pl ( 200mMTris-HCl ph 9.0, 5mM MgC12) 

Primer T7 or T3 7jxl ( lOOpmol/pl) 

15 Robot Biomek 2000, loaded the mix and the DNA samples into PCR-plates. 

PTC-200, (MJ Research) the thermal cycling instrument; 

95°C x 2', (95°C x 10", 50°C x 10", 60°C x 4') x 35 cycler and 8°C. 

Followed by centrifiigation and ethanol cleaning up step. 

1 5jil water was added before running on 3700. 
20 5'RACEofRORa 

Template used was Marathon Ready cDNA from Clontech (Human Adipocyte #7447(pooled 
from 11 male/female Caucasians, ages 19-57). 

Nested PCR was used to obtain discrete bands from the PCR reactions. In both reactions 
following concentrations was used; forward aiid reverse primer 200nM, dNTP 200pM, MgCl 
25 2mM, 0. 1 25U/jiL AmpliTaq Gold (Applied Biosystems) all in one time PCR buffer II 
(Applied Biosystems). 

In the first round, forward primer was API (linker primer, Clontech) (SEQ ID NO: 69). 
Reverse primer, situated in the RORalpha sequence (SEQ ID NO: 70). Template 2pL 
(~0.1ng/pL, Marathon Ready cDNA). This reaction mixture was then subjected to thermal 
30 cycling in a PTC-200 from MJ research. Thermal cycling included one step initial 
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denaturation of eight minutes at 95 °C, 30 cycles with 94°C fortyfive seconds, 66°C fortyfive 
seconds, 72°C one minute and a final extension of ten minutes at 72°. 
In the second round of PCR, forward primer was AP2 (linker primer, Clontech) (SEQ ID NO: 
71). Reverse primer was the same as was used in the first round of PCR (SEQ ID NO: 70). 
Temlate was lpL of a twenty times dilution of the first PCR-reaction. This mixture was 
subjected to another 30 cycles, as above, but with 64 instead of 66°C as annealing 
temperature. 

PCR reactions were loaded on a low melt agarose gel (1% from BioRad). Potentially 
interesting bands were chosen and cut out from the gel, slices melted at 65° and ligated into 
pcR2.1-TOPO® (Invitrogen). Ligation mixture was then used to transfect TOP 10 
(Invitrogen). 

Two clones from each band was then sequenced using Big Dye chemistry and 3 100 
Genetic Analyzer(Applied Biosystems). 
Taqman analysis of RORaS 

Primers where designed with Applied Biosystems program Primer Express 2.0 
Forward primer Tm 58,5°C was SEQ ID NO: 72. Reverse primer Tm 58,5°C was SEQ ID 
NO: 73. Template poly A+RNA from the following human tissues was obtained from 
Clontech; Bone marrow, brain, kidney, liver, lung, skeletal muscle, small intestine, spleen, 
testis and fat. CDNA synthesis was made with Superscript™ First-Strand Synthesis System 
for RT-PCR, Invitrogen™ according to the protocol supplied by the manufacturer. 
Real time PCR was performed according to protocol from the manufacturer and run on ABI 
Prism 7700® Sequence Detection System, Applied Biosystems. Results analysed with 
Sequence detector VI. 7 

PCR detection ofUNCSC splice variant transcript 

Forward primer was SEQ ID NO: 74 spanning bases 1586-1604 of human UNC5C 
sequence AF055634. Reverse primer was SEQ ID NO 75 spanning 2386-2367 of transcript 
AF055634. Template was cDNA from human brain (mRNA from Clontech converted into 
cDNA using "Superscript™ First-Strand Synthesis System for RT-PCR" (Invitrogen)) 
according to protocol from the manufacturer. This PCR resulted in 3 bands. The expected one 
at ~800bp, a very faint one that was slightly shorter and one at ~550bp. The 800 and 550 
bands were cut out of the gel and purified using Qiaquick gel extraction kit (Qiagen) and 
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then sequenced using Big Dye chemistry and 3100 Genetic Analyzer (Applied Biosysterus). 
The 800 band was the expected UNC5c sequence i.e. partial UNC5c spanning bases 1586- 
2386. The 550bp fragment was UNC5C with a deletion of bases 1873-2078. This deletion is 
identical to the deletion in the sequence of UNC5Cb with accession number AAC90914. 
Bioinformatics 

Sequence assembly and alignment of gene sequences was made using Sequencer software 
from Gene codes corporation version 4. 1 2 on a Macintosh G4. 
Primary cell culture 

The following products were purchased from BioWhittaker (MD, USA): DMEM F-12, 
FBM (fibroblast basal medium), human fibroblast growth factor-P (hFGF-P; 1 jig/nil), 
Trypsin EDTA (200 mg/ml and 500 mg/ml), Fungizone (250pg/ml). Foetal calf serum (FCS) 
was provided from Harlan Sera lab (Leicestershire, UK). Pioglitasone (X-46) was from 
AstraZeneca R&D MolndaL 

0.008M Phosphate buffered saline (PBS); insulin and transferrin were obtained from Life 
Technologies. BSA fraction V, Collagenase (C-6885), BBMX (3-isobutyl-l-methylxanthine), 
dexamethasone, 0.05M Tris Buffered Saline pH 8.0 were purchased from SIGMA. P- 
mercaptoethanol and glucose were from Kebo lab. Primaria™ tissue culture flasks were from 
Falcon. Nile Red was from Molecular Probes. 

Biopsies from humans were taken under aseptic conditions and transferred to 
DMEM/F12 supplemented with 25 jig/ml Gentamycin and Fungizone (2.5 pg/ml). The 
adipose tissue was digested in DMEM F-12 containing albumin (1 mg/ml), collagenase (1 
mg/ml), glucose (2 mg/ml), 25 pg/ml Gentamycin and Fungizone (2.5 pg/ml) in a shaking 
waterbath at 37°C for 30-45 min. The cell suspension was then filtered through a nylon mesh 
(lOOjarn) and centrifuged at 650g for 10 minutes. The sedimented cells, preadipocytes, were 
resuspended and washed twice in DMEM F-12 supplemented with 25 pg/ml Gentamycin and 
Fungizone (2.5 jig/ml). After the last wash the cells were resuspended in FBM (fibroblast 
medium) supplemented with 10% FCS, 25 pg/ml Gentamycin and Fungizone (2.5 |ig/mL, 
0.1% 0.1M mercaptoethanol, 10 mg/ml insulin, 10 mg/ml transferrin and 0.001 pig/ml hFGF- 
P The cells was seeded and incubated in small Primaria culture flasks (25 cm 2 ) over night 
followed by washing with PBS and culture continued until the cells reached confluence. 
Thereafter the cells were washed in PBS (without Mg 44 " and Ca 44 ), treated with trypsin EDTA 
for a few minutes, resuspended and splited in fresh FBM and seeded into new small culture 
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flasks. The medium was changed every day. The new cell lines were named PSc020108 
(male) and Asc020 108 (female). 

The fraction with the mature adipocytes (after protease treatment) was frozen in -80°C for 

further RNA preparation. 

Differentiation 

The undifferentiated preadipocytes were seeded into small cell culture flasks and cultured 
in DMEM/F12 supplemented with 10% FCS, 1% PEST, 10 pg/ml insulin, 10 jig/ml 
transferrin and 0.1 mM P-mercaptoethanol (referred to as standard medium). When the cells 
have reached confluence the differentiation was induced by adding standard medium 
supplemented with 0.25pM dexamethasone and 0.5 mM EBMX (referred to as induction 
medium 1) for three days. To maintain and extend the differentiation process the medium was 
changed to standard medium containing 10 pM X-46 and 0.25pM Dexamethasone (referred 
to as induction medium 2) for 2 days. This process was repeated once and after that the 
culture process was continued only with standard medium supplemented with 0.25pM 
dexamethasone (referred to as maintenance mediuml). 
The lipid accumulation in the adipocytes was confirmed by Nile Red staining. 
Staining cells with Nile Red 

Nile red is a benzophenoxazone dye. In older chemical and histochemical literature the 
dye sometimes referred to as nileblue A-oxazone. Nile red is poorly soluble in water but does 
dissolve in a wide variety of organic solvents. 

A Img/ml stock solution of Nile red in acetone were prepared and stored protected from 
light. Staining could be carried out on either fixed (1 .5% glutaraldehyde in PBS, for 5 min) or 
unfixed cells. Attached unfixed cultured cells were covered with PBS. The dye was then 
added directly to the flask to effect a 1 : 100 dilution to yield a final concentration of 10 pg/ml. 
The preparation was incubated for a minimum of 5 min. For fluorescence microscopy excess 
dye was removed by briefly rinsing the cells with PBS. The suspension medium must not 
contain serum or albumin since they could act as a sink to draw Nile red out of the cells. Dye 
in the medium does not interfere with the observations since Nile red fluorescence is 
quenched in water. Nile red is not toxic for the cells so it is possible to continue the culture 
after staining when unfixed cells are used. 
Morphology and Fluorescence microscopy 

The differentiation of adipocyte precursors into mature adipocytes was assessed by 
conventional microscopy at a 100-200-fold magnification. Cells were considered 
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differentiated when their cytoplasm was filled with lipid droplets, clearly seen in phase 
contrast. The fluorescence microscopy studies was carried out using a Zeiss axiovert 25 
inverted microscope, equipped with a HBO 50 illuminator and with a CP-Achromat 1 Ox/0.25 
phi /Varl objective lens and the excitation wavelength was 485 nm (filter set 487909). 
5 The stained cells were photographed in color using the AxioVision system from Carl 
Zeiss. The pictures were processed in Adobe PhotoShop 6.0. 
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CLAIMS 

1 . An isolated nucleic acid molecule comprising a nucleotide sequence having at least 65% 
identity to a degenerate variant of SEQ ID NO: 1 , SEQ E) NO: 3 or SEQ ID NO: 7. 

5 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
comprises the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 7. 

3. An isolated nucleic acid molecule comprising the complement of SEQ ID NO: 1, SEQ ID 
10 NO: 3 or SEQ ID NO: 7. 

4. An isolated nucleic acid molecule encoding a RORal-UNC5C polypeptide, said nucleic 
acid molecule comprising a nucleotide sequence having at least 65% identity to a degenerate 
variant of SEQ ID NO: 1 or SEQ ID NO: 3. 

15 

5. An isolated nucleic acid molecule encoding a RORocS polypeptide, said nucleic acid 
molecule comprising a nucleotide sequence having at least 65% identity to a degenerate 
variant of SEQ ID NO: 7. 

20 6. A vector comprising the nucleic acid molecule of any of claims 1-5. 

7. A host cell comprising the vector of claim 6. 

8. A purified fusion polypeptide of the RORal-UNC5C polypeptide comprising an amino 
25 acid sequence having at least 65% identity to the amino acid sequence of SEQ ID NO: 2 or 

SEQ ID NO: 4. 

9. The purified polypeptide of claim 8, wherein the amino acid sequence comprises the amino 
acid sequence of SEQ ID NO: 2 or SEQ ID NO:4. 

30 

10. A purified polypeptide of the RORocS polypeptide comprising an amino acid sequence 
having at least 65% identity to the amino acid sequence of SEQ ID NO: 8. 
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11. The purified polypeptide of claim 10, wherein the amino acid sequence comprises the 
amino acid sequence of SEQ ID NO: 8. 

12. A method for producing a protein comprising: 

5 a) culturing the host cell of claim 7 under conditions whereby the protein is produced, 
and 

b) recovering the protein from the host cell culture. 

1 3 . A method for detecting a polynucleotide which encodes a protein comprising the amino 
10 acid sequence of SEQ ID NO:2, SEQ ID NO:4 or SEQ ID NO: 8 in a biological sample 

comprising the steps of: 

a) hybridizing a complement of the polynucleotide sequence which encodes SEQ ID NO: 
2, SEQ ID NO:4 or SEQ ID NO: 8 to a nucleic acid material of a biological sample, thereby 
forming a hybridization complex; and 
15 b) detecting the hybridization complex, wherein the presence of the complex correlates 
with the presence of a polynucleotide encoding a protein in the biological sample. 

14: A method for detecting the presence of an obesity susceptibility gene comprising: 
determining the identity of a gene located 20 cM, or less, adjacent to the 
20 translocation breakpoint between chromosome 4 at cytoband 4q22.3 and chromosome 15 at 
cytoband 15q22.2; and 

determining if the gene has aberrant gene expression as a consequence of the 
translocation event 

25 15. A method for detecting the presence of a translocation junction between chromosome 4 at 
cytoband 4q22.3 and chromosome 15 at cytoband 15q22.2 comprising analysing a sample of 
DNA from an individual for the presence of the translocation junction. 

16. A method for identifying a test compound that modulates the expression of an obesity 
30 susceptibility gene identified in claim 14 comprising: 

contacting a cell capable of expressing the susceptibility gene with a test compound; and 
determining the level of expression of the obesity susceptibility gene in the presence of 

the test compound, wherein a decrease or an increase in expression of a obesity susceptibility 
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gene, as compared to the level of expression of an obesity susceptibility gene in the absence 
of the compound, is indicative that the test compound modulates the expression of the obesity 
susceptibility gene. 

5 1 7. A method of identifying a test compound that modulates the activity of an obesity protein 
encoded by the obesity susceptibility gene identified in claim 14, comprising: 
contacting the obesity protein with a test compound; and 

determining the level of activity of the obesity protein in the presence of the compound, 
wherein a decrease or an increase in obesity protein activity, as compared to the level of 
10 activity of the obesity protein in the absence of the compound, is indicative that the test 
compound modulates obesity protein activity. 

1 8. A method of treating a subject having obesity comprising admMstering an effective 
amount of the compound identified in claims 1 6 or 1 7. 

15 . 

19. Use of a compound identified in claims 16 or 17 in the preparation of a medicament for 
the treatment of obesity. 

20. A pharmaceutical composition comprising a compound identified in claims 16 or 17, and 
20 a pharmaceutical^ acceptable adjuvant, diluent or carrier. 

21. A method of making a pharmaceutical composition comprising: 

contacting a cell capable of expressing an obesity susceptibility gene with a test 
compound; 

25 determining the level of expression of the obesity susceptibility gene in the presence of 
the test compound, wherein a decrease in expression of a obesity susceptibility gene, as 
compared to the level of expression of a obesity susceptibility gene in the absence of the 
compound, is indicative that the test compound decreases obesity susceptibility gene 
expression; and 

30 formulating the test compound that decreases obesity susceptibility gene expression into a 
pharmaceutical composition. 
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22. A method of making a pharmaceutical composition comprising: 
contacting an obesity protein with a test compound; 

determining the level of activity of the obesity susceptibility protein in the presence of the 
compound, wherein a decrease in obesity protein activity, as compared to the level of activity 
5 of the obesity protein in the absence of the compound, is indicative that the test compound 
decreases obesity protein activity; and 

formulating the test compound that decreases obesity protein activity into a 
pharmaceutical composition. 

10 23. A method for determining if an obesity susceptibility gene identified in claim 14 has an 
altered level of gene expression comprising: 

comparing the level of obesity gene expression in a cell from a patient having obesity 
with a control cell, and 

determining the level of expression of the obesity susceptibility gene in both cells, 
15 wherein a decrease or an increase in expression of the obesity susceptibility gene, as 
compared to the level of expression of the obesity susceptibility gene in the control cell, 
indicates that the obesity susceptibility gene has altered gene expression. 

24. A method of diagnosing obesity, or a susceptibility thereto in a subject, the method 
20 comprising: 

determining the level of mRNA of UNC5C; and 

comparing the level of mRNA of UNC5C in the sample with a control, wherein an 
increase in the level of UNC5C in the sample compared to the control indicates that the 
subject has obesity , or a susceptibility thereto. 

25 

25. A method of diagnosing obesity or a susceptibility thereto in a subject, the method 
comprising: 

. determining the level of aUNCSC protein in a sample from a subject; and 
comparing the level of UNC5C in the sample with a control, wherein an increase in the 
30 level of the protein in the sample compared to the control indicates that the subject has 
obesity, or a susceptibility thereto. 
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26. A method of diagnosing obesity, or a susceptibility thereto in a subject, the method 
comprising analysing for the presence of RORlal-UNC5C mRNA, wherein the presence of 
the mRNA indicates that the subject has obesity, or a susceptibility thereto. 

27. A method of diagnosing obesity or a susceptibility thereto in a subject, the method 
comprising analysing for the presence of the RORlal-UNC5C fusion polypeptide, wherein 
the presence of the polypeptide indicates that the subject has obesity, or a susceptibility 
thereto. » 
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SEQUENCE LISTING 



<110> AstraZeneca AB 

5 <120> Molecules 

<130> CCH/100852 

<150> GB 0306185 

10 <151> 2003-03-19 



15 



<160> 



<170> Patentln version 3.2 



<210> 
<211> 
<212> 
<213> 



1 

2986 
OTA 

Homo sapiens 



20 

<400> 1 

ctgaaaacag aagatagaga ggagtctcgg agctcgccat ctccagcgat ctctacattg 60 

ggaaaaaaca tggagtcagc tccggcagcc cccgaccccg ccgccagcga gccaggcagc 120 

agcggcgcgg acgcggccgc cggctccagg gagaccccgc tgaaccagga atccgcccgc 180 

25 aagagcgagc cgcctgcccc ggtgcgcaga cagagctatt ccagcaccag cagagatgat 240 

gacttttttc atgaactccc agaaactttt ccttctgatc cacctgagcc tctgccacat 300 • 

ttccttattg agcctgaaga agcttatatt gtgaagaata agcctgtgaa c'ctgtactgt 360 

aaagcaagcc ctgccaccca gatctatttc aagtgtaata gtgaatgggt tcatcagaag 420 

gaccacatag tagatgaaag agtagatgaa acttccggtc tcattgtccg ggaagtgagc 480 

30 attgagattt cgcgccagca agtggaagaa ctctttggac ctgaagatta ctggtgccag 540 

tgtgtggcct ggagctccgc gggtaccaca aagagccgga aggcgtatgt gcgcattgca 600 

tatctacgga agacatttga gcaggaaccc ctaggaaagg aagtgtcttt ggaacaggaa 660 

gtcttactcc agtgtcgacc acctgaaggg atcccagtgg ctgaggtgga atggttgaaa 720 

aatgaagaca taattgatcc cgttgaagat cggaattttt atattactat tgatcacaac 780 

35 ctcatcataa agcaggcccg actctctgat actgcaaatt acacctgtgt tgccaaaaac 840 

attgttgcca agaggaaaag tacaactgcc actgtcatag tctatgtcaa cggtggctgg 900 

tccacctgga cggagtggtc tgtgtgtaac agccgctgtg gacgagggta tcagaaacgt 960 

acaaggactt gtaccaaccc ggcaccactc aatgggggtg ccttctgtga agggcagagt 1020 

gtgcagaaaa tagcctgtac tacgttatgc ccagtggatg gcaggtggac gccatggagc 1080 

40 aagtggtcta cttgtggaac tgagtgcacc cactggcgca ggagggagtg cacggcgcca 1140 

gcccccaaga atggaggcaa ggactgcgac ggcctcgtct tgcaatccaa gaactgcact 1200 

gatgggcttt gcatgcagag tttcatttat cccatttcaa ctgaacagag aacccagaat 1260 

gaatatggat tttcttctgc tcctgattca gatgatgttg ctctctatgt tgggattgtg 1320 

atagcagtga tcgtttgcct ggcgatctct gtagttgtgg ccttgtttgt gtatcggaag 1380 

45 aatcatcgtg actttgagtc agatattatt gactcttcgg cactcaatgg gggctttcag 1440 

cctgtgaaca tcaaggcagc aagacaagat ctgctggctg tacccccaga cctcacgtca 1500 

gctgcagcca tgtacagagg acctgtctat gccctgcatg acgtctcaga caaaatccca 1560 

atgaccaact ctccaattct ggatccactg cccaacctga aaatcaaagt gtacaacacc 1620 

tcaggtgctg tcacccccca agatgacctc tctgagttta cgtccaagct gtcccctcag 1680 

50 atgacccagt cgttgttgga gaatgaagcc ctcagcctga agaaccagag tctagcaagg 1740 

cagactgatc catcctgtac cgcatttggc agcttcaact cactgggagg tcaccttatt 1800 

gttcccaatt caggagtcag cttgctgatt cccgctgggg ccattcccca agggagagtc 1860 

tacgaaatgt atgtgactgt acacaggaaa gaaactatga ggccacccat ggatgactct 1920 

cagacacttt tgacccctgt ggtgagctgt gggcccccag gagctctgct cacccgcccc 1980 

55 gtcgtcctca ctatgcatca ctgcgcagac cccaataccg aggactggaa aatactgctc 2040 

aagaaccagg cagcacaggg acagtgggag gatgtggtgg tggtcgggga ggaaaacttc 2100 

accaccccct gctacattca gctggatgca gaggcctgcc acatcctcac agagaacctc 2160 

agcacctacg ccctggtagg acattccacc accaaagcgg ctgcaaagcg cctcaagctg 2220 

gccatctttg ggcccctgtg ctgctcctcg ctggagtaca gcatccgagt ctactgtctg 2280 

60 gatgacaccc aggatgccct gaaggaaatt ttacatcttg agagacagac gggaggacag 2340 

ctcctagaag aacctaaggc tcttcatttt aaaggcagca cccacaacct gcgcctgtca 2400 

attcacgata tcgcccattc cctctggaag agcaaattgc tggctaaata tcaggaaatt 2460 

ccattttacc atgtttggag tggatctcaa agaaacctgc actgcacctt cactctggaa 2520 

agatttagcc tgaacacagt ggagctggtt tgcaaactct gtgtgcggca ggtggaagga 2580 

65 gaagggcaga tcttccagct caactgcacc gtgtcagagg aacctactgg catcgatttg 2640 

ccgctgctgg atcctgcgaa caccatcacc acggtcacgg ggcccagtgc tttcagcatc 2700 

cctctcccta tccggcagaa gctctgtagc agcctggatg ccccccagac gagaggccat 2760 

gactggagga tgctggccca taagctgaac ctggacaggt acttgaatta ctttgccacc 2820 
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aaatccagcc caactggcgt aatcctggat ctttgggaag cacagaactt cccagatgga 2880 
aacctgagca tgctggcagc tgtcttggaa gaaatgggaa gacatgaaac ggtggtgtcc 2940 
ttagcagcag aagggcagta ttaaccacca tgctggaagg ggaaat 2986 

5 

<210> 2 

<211> 964 

<212> PRT 

<213> Homo sapiens 

10 

<400> 2 

Met Glu Ser Ala Pro Ala Ala Pro Asp Pro Ala Ala Ser Glu Pro Gly 
.1 5 10 15 

15 Ser Ser .Gly Ala Asp Ala Ala Ala Gly Ser Arg Glu Thr Pro Leu Asn 
.20 25 30 

Gin Glu. Ser Ala Arg Lys Ser Glu Pro Pro Ala Pro Val Arg Arg Gin 
35 40 45 

20 

Ser Tyr Ser Ser Thr Ser Arg Asp Asp Asp Phe Phe His Glu Leu Pro 
50 55 60 

Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe Leu lie 
25 65 70 75 .80 

Glu Pro Glu Glu Ala Tyr lie Val Lys Asn Lys Pro Val Asn Leu Tyr . 

85 90 ' .95 

30 Cys Lys Ala Ser Pro Ala Thr Gin lie Tyr Phe Lys Cys Asn Ser Glu 
100 105 110 

Trp Val His Gin Lys Asp His lie Val Asp Glu Arg Val Asp Glu Thr 
115 120 125 . 

35 

Ser Gly Leu lie Val Arg Glu Val Ser lie Glu lie Ser Arg Gin Gin 
130 135 140 

Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys Val Ala 
40 145 150 155 160 

Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val Arg lie 
165 170 175 

45 Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys Glu Val 
180 185 190 

Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu Gly He 
195 200 205 

50 

Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp He He Asp Pro 
210 215 220 

Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu He He 
55 225 230 . 235 240 

Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val Ala Lys 
245 250 255 

60 Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He Val Tyr 
260 265 270 

Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys Asn Ser 
275 280 285 

65 

Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr Asn Pro 
290 295 300 
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Ala Pro Leu.Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val Gin Lye 
305 310 315 320 

lie Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr Pro Trp 
5 . 325 330 335 

Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg Arg Arg 
340 345 350 

10 Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys Asp Gly 
355 360 . 365 

Leu Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met Gin Ser 
370 375 380 

15 

Phe lie Tyr Pro lie Ser Thr Glu Gin Arg Thr Gin Asn Glu Tyr Gly 
385 390 395 400 

Phe Ser Ser Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly lie 
20 405 410 415 

Val lie Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu 
420 425 .. 430 

25 Phe Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp lie He Asp 
435 440 445 

Ser Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala 
450 455 460 

30 

Arg Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr . Ser Ala Ala Ala 
465 470 475 480 . 

Met Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys He 
35 485 490 495 

Pro Met Thr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys He 
500 505 510 

40 Lys Val Tyr Asn Thr Ser Gly Ala Val Thr Pro Gin Asp Asp Leu Ser 
515 520 525 

Glu Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu 
530 535 540 

45 

Asn Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp 
.545 550 555 560 

Pro Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu 
50 565 570 575 

lie Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Gly Ala He 
580 585 590 - 

55 Pro Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu 
595 600 605 

Thr Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro. Val 
610 615 620 

60 

Val Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu 
625 630 635 640 

Thr Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu 
65 645 650 655 

. Leu Lys Asn Gin Ala Ala Gin Gly Gin Trp Glu Asp Val Val Val Val 
660 665 670 
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Gly Glu Glu Asn Phe Thr Thr Pro Cys Tyr lie Gin Leu Asp Ala Glu 
675 680 685 

5. Ala Cys His lie Leu Thr Glu Asn Leu Ser Thr Tyr Ala Leu Val Gly 
690 695 700 



10 



His Ser Thr Thr Lys Ala Ala Ala Lys Arg Leu Lys Leu Ala lie Phe 
705 710 715 720 

Gly Pro Leu Cys Cys Ser Ser Leu Glu Tyr Ser lie Arg Val Tyr Cys 
725 730 735 



Leu Asp Asp Thr Gin Asp Ala Leu Lys Glu lie Leu His Leu Glu Arg 
15 740 745 750 

Gin Thr Gly Gly Gin Leu Leu Glu Glu Pro Lys Ala Leu His Phe Lys 
755 760 765 

20 Gly Ser Thr His Asn Leu Arg Leu Ser lie His Asp lie Ala His Ser 
770 775 780 



25 



Leu Trp Lys Ser Lys Leu Leu Ala Lys Tyr Gin Glu lie Pro Phe Tyr 
785 790 795 800 

His Val Trp Ser Gly Ser Gin Arg Asn Leu His Cys Thr Phe Thr Leu 
805 810 815 



Glu Arg Phe Ser Leu Asn Thr Val Glu Leu Val Cys Lys Leu Cys Val 

30 820 825 830 

Arg Gin Val Glu Gly. Glu Gly Gin lie Phe Gin Leu Asn Cys Thr Val 
835 840 845 

35 Ser Glu Glu Pro Thr Gly lie Asp Leu Pro Leu Leu Asp Pro Ala Asn 
850 855 860 



40 



Thr lie Thr Thr Val Thr Gly Pro Ser Ala Phe Ser lie Pro Leu Pro 
865 870 875 880 

lie Arg Gin Lys Leu Cys Ser Ser Leu Asp Ala Pro Gin Thr Arg Gly 
885 890 895 



His Asp Trp Arg Met Leu Ala His Lys Leu Asn Leu Asp Arg Tyr Leu 
45 900 . 905 910 

Asn Tyr Phe Ala Thr Lys Ser Ser Pro Thr Gly Val He Leu Asp Leu 
915 920 925 

50 Trp Glu Ala Gin Asn Phe Pro Asp Gly Asn Leu Ser Met Leu Ala Ala 
930 935 . 940 



55 



Val Leu Glu Glu Met Gly Arg His Glu Thr Val Val Ser Leu Ala Ala 
945 950 955 960 

Glu Gly Gin Tyr 



60 <210> 3 

<211> 2780 

<212> DNA 

<213> Homo sapiens 

65 <400> 3 

ctgaaaacag aagatagaga ggagtctcgg agctcgccat ctccagcgat ctctacattg 60 

ggaaaaaaca tggagtcagc tccggcagcc cccgaccccg ccgccagcga gccaggcagc 120 

agcggcgcgg acgcggccgc cggctccagg gagaccccgc tgaaccagga atccgcccgc 180 
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aagagcgagc cgcctgcccc ggtgcgcaga cagagctatt ccagcaccag cagagatgat 240 
gacttttttc atgaactccc agaaactttt ccttctgatc cacqtgagcc tctgccacat 300 
ttccttattg agcctgaaga agcttatatt gtgaagaata agcctgtgaa cctgtactgt 360 
aaagcaagcc ctgccaccca gatctatttc aagtgtaata gtgaatgggt tcatcagaag 420 
5 gaccacatag tagatgaaag agtagatgaa acttccggtc tcattgtccg ggaagtgagc 480 
attgagattt cgcgccagca agtggaagaa ctctttggac ctgaagatta ctggtgccag 540 
tgtgtggcct ggagctccgc gggtaccaca aagagccgga aggcgtatgt gcgcattgca 600 
tatctacgga agacatttga gcaggaaccc ctaggaaagg aagtgtcttt ggaacaggaa 660 
gtcttactcc agtgtcgacc acctgaaggg atcccagtgg ctgaggtgga atggttgaaa 720 

10 aatgaagaca taattgatcc cgttgaagat cggaattttt atattactat tgatcacaac 780 
ctcatcataa agcaggcccg actctctgat actgcaaatt acacctgtgt tgccaaaaac 840 
attgttgcca agaggaaaag tacaactgcc actgtcatag tctatgtcaa cggtggctgg 900 
tccacctgga cggagtggtc tgtgtgtaac agccgctgtg gacgagggta tcagaaacgt 960 
acaaggactt gtaccaaccc ggcaccactc aatgggggtg . ccttctgtga agggcagagt 1020 

15 gtgcagaaaa tagcctgtac tacgttatgc ccagtggatg gcaggtggac gccatggagc 1080 
aagtggtcta cttgtggaac tgagtgcacc cactggcgca ggagggagtg cacggcgcca 1140 
gcccccaaga atggaggcaa ggactgcgac ggcctcgtct tgcaatccaa gaactgcact 1200 
gatgggcttt gcatgcagag tttcatttat cccatttcaa ctgaacagag aacccagaat 1260 
gaatatggat tttcttctgc tcctgattca gatgatgttg ctctctatgt tgggattgtg 1320 

20 atagcagtga tcgtttgcct ggcgatctct gtagttgtgg ccttgtttgt gtatcggaag 1380 
aatcatcgtg actttgagtc agatattatt gactcttcgg cactcaatgg gggctttcag 1440 
cctgtgaaca tcaaggcagc aagacaagat ctgctggctg tacccccaga cctcacgtca 1500 
gctgcagcca tgtacagagg acctgtctat gccctgcatg acgtctcaga caaaatccca 1560 
atgaccaact ctccaattct ggatccactg cccaacctga aaatcaaagt gtacaacacc 1620 

25 tcaggtgctg tcacccccca agatgacctc tctgagttta cgtccaagct gtcccctcag 1680 
. atgacccagt cgttgttgga gaatgaagcc ctcagcctga agaaccagag tctagcaagg 1740 
cagactgatc catcctgtac cgcatttggc agcttcaact cactgggagg tcaccttatt 1800 
gttcccaatt caggagtcag cttgctgatt cccgctgggg ccattcccca agggagagtc 1860 
tacgaaatgt atgtgactgt acacaggaaa gaaactatga ggccacccat ggatgactct 1920 

30 cagacacttt tgacccctgt ggtgagctgt gggcccccag gagctctgct cacccgcccc 1980 
gtcgtcctca ctatgcatca ctgcgcagac cccaataccg aggactggaa aatactgctc 2040 
ctcgctggag tacagcatcc gagtctactg tctggatgac acccaggatg ccctgaagga 2100 
aattttacat cttgagagac agacgggagg acagctccta gaagaaccta aggctcttca 2160 
ttttaaaggc agcacccaca acctgcgcct gtcaattcac gatatcgccc attccctctg 2220 

35 gaagagcaaa ttgctggcta aatatcagga aattccattt taccatgttt ggagtggatc 2280 
tcaaagaaac ctgcactgca ccttcactct ggaaagattt agcctgaaca cagtggagct 2340 
ggtttgcaaa ctctgtgtgc ggcaggtgga aggagaaggg cagatcttcc agctcaactg 2400 
caccgtgtca gaggaaccta ctggcatcga tttgccgctg ctggatcctg cgaacaccat 2460 
caccacggtc acggggccca gtgctttcag catccctctc cctatccggc agaagctctg 2520 

40 tagcagcctg gatgcccccc agacgagagg ccatgactgg aggatgctgg cccataagct 2580 
gaacctggac aggtacttga attactttgc caccaaatcc agcccaactg gcgtaatcct 2640 
ggatctttgg gaagcacaga acttcccaga tggaaacctg agcatgctgg cagctgtctt 2700 
ggaagaaatg ggaagacatg aaacggtggt gtccttagca gcagaagggc agtattaacc 2760 
accatgctgg aaggggaaat - 2780 

45 

<210> 4 
<211> 669' 
<212> PRT - 
50 <213> Homo sapiens 

<400> 4 

Met Glu Ser Ala Pro Ala Ala Pro Asp Pro Ala Ala Ser Glu Pro Gly 
1 .5 10 15 



55 



Ser Ser Gly Ala Asp Ala Ala Ala Gly Ser Arg Glu Thr Pro Leu Asn 
20 25 30 



Gin Glu Ser Ala Arg Lys Ser Glu Pro Pro Ala Pro Val Arg Arg Gin 
60 35 40 45 

Ser Tyr Ser Ser Thr Ser Arg Asp Asp Asp Phe Phe His Glu Leu Pro 
50 55 60 

65 Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe Leu lie 
65 70 75 80 



Glu Pro Glu Glu Ala Tyr lie Val Lys Asn Lys Pro Val Asn Leu Tyr 
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85 90 95 

Cys Lys Ala Ser Pro Ala Thr Gin lie Tyr Phe Lys Cys Ash Ser Glu 
100 105 110 

5 

Trp Val His Gin Lys Asp His lie Val Asp Glu Arg Val Asp Glu Thr 
115 120 125 

Ser Gly Leu lie Val Arg Glu Val Ser lie Glu lie Ser Arg Gin Gin 
10. 130 135 . 140 

Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys Val Ala 
145 150 155 160 

15 Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val Arg lie 
165 170 175 

Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys Glu Val 
180 185 190 

20 

Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu Gly lie 
.195 200 205 

Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp lie lie Asp Pro 
25 210 215 220 

Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu He lie 
225 230 235 240 

30 Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val Ala Lys 
245 250 255 

Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He Val Tyr 
260 265 270 

35 

Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys Asn Ser 
275 280 285 

Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr Asn Pro 
40 290 "* 295 300 

Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val Gin Lys 
305 310 315 320 

45 He Ala Cys Thr Thr Leu Cys Pro yal Asp Gly Arg Trp Thr Pro Trp 
325 330 335 

Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg Arg Arg 
340 345 350 

50 

Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys Asp Gly 
355 360 365 

Leu Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met Gin Ser 
55 370 375 380 

Phe He Tyr Pro He Ser Thr Glu Gin Arg Thr Gin Asn Glu Tyr Gly 
385 " 390 395 400 

60 Phe Ser Ser Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He 
405 410 415 

Val He Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu 
420 425 430 

65 

Phe Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp 
435 440 445 
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Ser Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn lie Lys Ala Ala 
450 455 460 

Arg Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala 
.5 465 470 475 " 480 

Met Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys lie 
485 490 495 

10 Pro Met Thr Asn Ser Pro lie Leu Asp Pro Leu Pro Asn Leu Lys lie 
500 .. • 505 510 

Lys Val Tyr Asn Thr Ser Gly Ala Val Thr Pro Gin Asp Asp Leu Ser 
515 520 525 

15 

Glu Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu 
530 535 540 

Asn Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp 
20 545 . 550 555 560 

Pro Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu 
565 570 575 

25 lie Val Pro Asn Ser Gly Val Ser Leu Leu lie Pro Ala Gly Ala lie 
580 585 590 

Pro Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu 
595 600 605 

30 

Thr Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val 
610 615 620 



Val Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu 
.35 625 630 635 640 

Thr Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys lie Leu 
.645 650 655 

40 Leu Leu Ala Gly Val Gin His Pro Ser Leu Leu Ser Gly 
660 665 



<210> 5 
45 <211> 1863 
<212> DNA 
<213> Homo sapiens 



<400> 5 

50 gttttttttt tttttttggt accatagagt 
cggagctcgc atctccagcg atctctacat 
cccccgaccc cgccgccagc gagccaggca 
gggagacccc gctgaaccag gaatccgccc 
gacagagcta ttccagcacc agcagaggta 

55 aaattgaaat tattccatgc aagatctgtg 
* tcattacatg tgaaggctgc aagggctttt 
actcctgtcc tcgtcagaag aactgtttga 
actgtcgatt acagaaatgc cttgccgtag 
gaatgtcaaa aaagcagaga gacagcttgt 

60 agcagcagcg cgaccaccag cagcagcctg 
acatctcggc caacgggctg acggaacttc 
acacccctga ggggagtaag gcagactccg 
cttccccaga ccagtcaggt cttgatatca 
acacaccagc atcaggcttc tttccctact 

65 ctgtgtccat ggcagaatta gaacaccttg 
cctgccaata cttgagagaa gagctccagc 
aaattgagaa ctatcaaaac aagcagcggg 
ttacagaagc tatacagtat gtggtggagt 



tgctctgaaa acagaagata gagggagtct 60 
tgggaaaaaa catggagtca gctccggcag 120 
gcagcggcgc ggacgcggcc gccggctcca 180 
gcaagagcga gccgcctgcc ccggtgcgca 240 
tctcagtaac gaagaagaca catacatctc 300 
gagacaaatc atcaggaatc cattatggtg 360 
tcaggagaag tcagcaaagc aatgccacct 420 
ttgatcgaac cagtagaaac cgctgccaac 480 
ggatgtctcg agatgctgta aaatttggcc 540 
atgcagaagt acagaaacac cggatgcagc 600 
gagaggctga gccgctgacg cccacctaca 660 
acgacgacct cagtaactac attgacgggc 720 
ccgtcagcag cttctacctg gacatacagc 780 
atggaatcaa accagaacca atatgtgact 840 
gttcgttcac caacggcgag acttccccaa 900 
cacagaatat atctaaatcg catctggaaa 960 
agataacgtg gcagaccttt ttacaggaag 1020 
aggtgatgtg gcaattgtgt gccatcaaaa 1080 
ttgccaaacg cattgatgga tttatggaac 1140 



WO 2004/083371 



PCT/GB2004/001124 



" 8 - 

tgtgtcaaaa tgatcaaatt gtgcttctaa aagcaggttc tctagaggtg gtgtttatca 1200 

gaatgtgccg tgcctttgac tctcagaaca acaccgtgta ctttgatggg aagtatgcca 1260 

gccccgacgt cttcaaatcc ttaggttgtg aagactttat tagctttgtg tttgaatttg 1320 

gaaagagttt atgttctatg cacctgactg aagatgaaat tgcattattt tctgcatttg 1380 

5 tactgatgtc agcagatcgc tcatggctgc aagaaaaggt aaaaattgaa aaactgcaac 1440 

agaaaattca gctagctctt caacacgtcc tacagaagaa tcaccgagaa gatggaatac 1500 

taacaaagtt aatatgcaag gtgtctacat- taagagcctt atgtggacga catacagaaa 1560 

agctaatggc atttaaagca atatacccag acattgtgcg acttcatttt cctccattat 1620 

acaaggagtt gttcacttca gaatttgagc cagcaatgca aattgatggg taaatgttat 1680 

10 cacctaagca cttctagaat gtctgaagta caaacatgaa aaacaaacaa aaaaattaac 1740 

cgagacactt tatatggccc tgcacagacc tggagcgcca cacactgcac atcttttggt 1800 

gatcggggtc aggcaaagga ggggaaacaa tgaaaacaaa taaagttgaa cttgtttttc 1860 

• tea 1863 

15 

<210> 6 
<211> 523 
<212> PRT 

<213> Homo sapiens 

20 

<400> 6 

Met Glu Ser Ala Pro Ala Ala Pro Asp Pro Ala Ala Ser Glu Pro Gly 

1 . . 5 10 15 

25 Ser Ser Gly Ala Asp Ala Ala Ala Gly Ser Arg Glu Thr Pro Leu Asn 
20 25 30 

Gin Glu Ser Ala Arg Lys Ser Glu Pro Pro Ala Pro Val Arg Arg Gin 
35 .40 45 

30 

Ser Tyr Ser Ser Thr Ser Arg Gly lie Ser Val Thr Lys Lys Thr His 
50 55 60 

Thr Ser Gin lie Glu lie He Pro Cys Lys He Cys Gly Asp Lys Ser 
35 65 70 75 80 

Ser Gly He His Tyr Gly Val He Thr Cys Glu Gly Cys Lys Gly Phe 
85 90 95 

40 Phe Arg Arg Ser Gin Gin Ser Asn Ala Thr Tyr Ser Cys Pro Arg Gin 
100 105 110 

Lys Asn Cys Leu He Asp Arg Thr Ser Arg Asn Arg Cys Gin His. Cys 
115 120 125 

45 

Arg Leu Gin Lys Cys Leu Ala Val Gly Met Ser Arg Asp Ala Val Lys 
130 135 140 

Phe Gly Arg Met Ser Lys Lys Gin Arg Asp Ser Leu Tyr Ala Glu Val 
50 145 150 155 160 

Gin Lys His Arg Met Gin Gin Gin Gin Arg Asp His Gin Gin Gin Pro 
165 170 175 

55 Gly Glu Ala Glu Pro Leu Thr Pro Thr Tyr Asn He Ser Ala Asn Gly 
180 185 190 

Leu Thr Glu Leu His Asp Asp Leu Ser Asn Tyr lie Asp Gly His Thr 
195 200 205 

60 

Pro Glu Gly Ser Lys Ala Asp Ser Ala Val Ser Ser Phe Tyr Leu Asp 
210 215 220 

He Gin Pro Ser Pro Asp Gin Ser Gly Leu Asp He Asn Gly He Lys 
65 225 230 235 240 

Pro Glu Pro He Cys Asp Tyr Thr Pro Ala Ser Gly Phe Phe Pro Tyr 
245 250 255 
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Cys Ser Phe Thr Asn Gly Glu Thr Ser Pro Thr Val Ser Met Ala Glu 
260 265 270 

. 5 Leu Glu His Leu Ala Gin Asn lie Ser Lys Ser His Leu Glu Thr Cys 
275 280 285 

Gin Tyr Leu Arg Glu Glu Leu Gin Gin lie Thr Trp Gin Thr Phe Leu 
290 . 295 300 

10 

Gin Glu Glu lie Glu Asn Tyr Gin Asn Lys Gin Arg Glu Val Met Trp 
. 305 310 . 315 320 

Gin Leu Cys Ala He Lys He Thr Glu Ala He Gin Tyr Val Val Glu 
15 325 « 330 335 

Phe Ala Lys Arg He Asp Gly Phe Met Glu Leu Cys Gin Asn Asp Gin 
340 345 350 

20 He Val Leu Leu Lys Ala Gly Ser Leu Glu Val Val Phe He Arg Met 
355 360 365 

Cys Arg Ala Phe Asp Ser Gin Asn Asn Thr Val Tyr Phe Asp Gly Lys 
370 375 380 

25 

Tyr Ala Ser Pro Asp Val Phe Lys Ser Leu Gly Cys Glu Asp Phe lie 
385 390 395 400 

Ser Phe Val Phe Glu Phe Gly Lys Ser Leu Cys Ser Met His Leu Thr 
30 405 410 415 

Glu Asp Glu He Ala Leu Phe Ser Ala Phe Val Leu Met Ser Ala Asp 
420 425 430 

35 Arg Ser Trp Leu Gin Glu Lys Val Lys He Glu Lys Leu Gin Gin Lys 
435 440 445 

He Gin Leu Ala Leu Gin His Val Leu Gin Lys Asn His Arg Glu Asp 
450 455 460 

40 

Gly He Leu Thr Lys Leu lie Cys Lys Val Ser Thr Leu Arg Ala Leu 
. 465 470 475 480 

Cys Gly Arg His Thr Glu Lys Leu Met Ala Phe Lys Ala lie Tyr Pro 
45 485 490 495 

Asp He Val Arg Leu His Phe Pro Pro Leu Tyr Lys Glu Leu Phe Thr 
500 505 510 

50 Ser Glu . Phe Glu Pro Ala Met Gin He Asp Gly , 
515 520 



<210> 7 
55 <211> 1821 
<212> DMA 
<213> Homo sapiens 

<400> 7 

60 ctagattcct ctgtgtttca tacccaggag gtcagggagg gcaccttgta gtgcctgagg- 60 

ccccgagagg gttcacagcc acctgagaag tgtttgcagt ctatgggtgt gtgttctttc 120 

tccaagctgt gtcagctgtg ggatggcgtg gagagaaaac aagaaaaagt tgtttccctc 180 

agctctctgc tgaaaaaatg ctcacgggga gtcagtcaac atttggtatc tcagtaacga 240 

agaagacaca tacatctcaa attgaaatta ttccatgcaa gatctgtgga gacaaatcat 300 

65 caggaatcca ttatggtgtc attacatgtg aaggctgcaa gggctttttc aggagaagtc 3 60 

agcaaagcaa tgccacctac tcctgtcctc gtcagaagaa ctgtttgatt gatcgaacca 420 

gtagaaaccg ctgccaacac tgtcgattac agaaatgcct tgccgtaggg atgtctcgag 480 

atgctgtaaa atttggccga atgtcaaaaa agcagagaga cagcttgtat gcagaagtac 540 
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agaaacaccg gatgcagcag cagcagcgcg accaccagca gcagcctgga gaggctgagc 600 
. cgctgacgcc cacctacaac atctcggcca acgggctgac ggaacttcac gacgacctca 660 
gtaactacat tgacgggcac acccctgagg ggagtaaggc agactccgcc gtcagcagct 720 
tctacctgga catacagcct tccccagacc agtcaggtct tgatatcaat ggaatcaaac 780 
5 . cagaaccaat atgtgactac acaccagcat caggcttctt tccctactgt tcgttcacca 840 
acggcgagac ttccccaact gtgtccatgg cagaattaga acaccttgca cagaatatat 900 
ctaaatcgca tctggaaacc tgccaatact tgagagaaga gctccagcag ataacgtggc 960 
agaccttttt acaggaagaa attgagaact atcaaaacaa gcagcgggag gtgatgtggc 1020 
aattgtgtgc catcaaaatt acagaagcta tacagtatgt ggtggagttt gccaaacgca 1080 

10 ttgatggatt tatggaactg tgtcaaaatg atcaaattgt gcttctaaaa gcaggttctc 1140 
tagaggtggt gtttatcaga atgtgccgtg cctttgactc tcagaacaac accgtgtact 1200 
ttgatgggaa gtatgccagc cccgacgtct tcaaatcctt aggttgtgaa gactttatta 1260 
gctttgtgtt tgaatttgga aagagtttat gttctatgca cctgactgaa gatgaaattg 1320 
cattattttc tgcatttgta ctgatgtcag cagatcgctc atggctgcaa gaaaaggtaa 1380 

15 aaattgaaaa actgcaacag aaaattcagc tagctcttca acacgtccta cagaagaatc 1440 
accgagaaga tggaatacta acaaagttaa tatgcaaggt gtctacatta agagccttat 1500 
gtggacgaca tacagaaaag ctaatggcat ttaaagcaat atacccagac attgtgcgac 1560 
. ttcattttcc tccattatac aaggagttgt tcacttcaga atttgagcca gcaatgcaaa 1620 
ttgatgggta aatgttatca cctaagcact tctagaatgt ctgaagtaca aacatgaaaa 1680 

20 acaaacaaaa aaattaaccg agacacttta tatggccctg cacagacctg gagcgccaca 1740 
cactgcacat cttttggtga tcggggtcag gcaaaggagg ggaaacaatg aaaacaaata 1800 
. aagttgaact tgtttttctc a 1821 

25 <210> 8 
<211> 477 
<212> PRT 

<213> Homo sapiens 
30 <400> 8 

Met Leu Thr Gly Ser Gin Ser Thr Phe Gly He Ser Val Thr Lys Lys 
1 5 10 15 

Thr His Thr Ser Gin He Glu He He Pro Cys Lys He Cys Gly Asp 
35 20 25 30 

Lys Ser Ser Gly He His Tyr Gly Val He Thr Cys Glu Gly Cys Lys 
35 40 45 

40 Gly Phe Phe Arg Arg Ser Gin Gin Ser Asn Ala Thr Tyr Ser Cys Pro 
50 55 60 



45 



Arg Gin. Lys Asn Cys Leu He Asp Arg Thr Ser Arg Asn Arg Cys Gin 
65 70" 75 80 

His Cys Arg Leu Gin Lys Cys Leu Ala Val Gly Met Ser Arg Asp Ala 
85 90 95 



Val Lys Phe Gly Arg Met Ser Lys Lys Gin Arg Asp Ser Leu Tyr Ala 
50 100 105 • 110 

Glu Val Gin Lys His Arg Met Gin Gin Gin Gin Arg Asp His Gin Gin 
115 120 125 

55 Gin Pro Gly Glu Ala Glu Pro Leu Thr Pro Thr Tyr Asn He Ser Ala 
130 135 140 

Asn Gly Leu Thr Glu Leu His Asp Asp Leu Ser Asn Tyr He Asp Gly 
145 150 155 160 

60 

His Thr Pro Glu Gly Ser Lys Ala Asp Ser Ala Val Ser Ser Phe Tyr 
165 170 175 

Leu Asp He Gin Pro Ser Pro Asp Gin Ser Gly Leu Asp He Asn Gly 
65 180 185 190 

He Lys Pro Glu Pro He Cys Asp Tyr Thr Pro Ala Ser Gly Phe Phe 
195 200 205 
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Pro Tyr Cys Ser Phe Thr Asn Gly Glu Thr Ser Pro Thr Val Ser Met 
210 215 220 

5 Ala Glu Leu Glu His Leu Ala Gin Asn lie Ser Lys Ser His Leu Glu . 
225 230 235 240 

Thr Cys Gin Tyr Leu Arg Glu Glu Leu Gin Gin lie Thr Trp Gin Thr 
245 250 255 

10 

Phe Leu Gin Glu Glu lie Glu Asn Tyr Gin Asn Lys Gin Arg Glu Val 
260 265 270 

Met Trp Gin Leu Cys Ala lie Lys He Thr Glu Ala He Gin Tyr Val 
15 275 280 285 

Val Glu Phe Ala Lys Arg He Asp Gly Phe Met Glu Leu Cys Gin Asn 
290 295 300 

20 Asp Gin He Val Leu Leu Lys Ala Gly Ser Leu Glu Val Val Phe He 
305 310 315 320 

Arg Met Cys Arg Ala Phe Asp Ser Gin Asn Asn Thr Val Tyr Phe Asp 
325 330 335 

25 

Gly Lys Tyr Ala Ser Pro Asp Val Phe Lys Ser Leu Gly Cys Glu Asp 
340 345 350 

Phe He Ser Phe Val Phe Glu Phe Gly Lys Ser Leu Cys Ser Met His 
30 355 360 365 

Leu Thr Glu Asp Glu He Ala Leu Phe Ser Ala Phe Val Leu Met Ser 
370 375 380 

35 Ala Asp Arg Ser Trp Leu Gin Glu Lys Val Lys He Glu Lys Leu Gin 
385 390 395 400 

Gin Lys He Gin Leu Ala Leu Gin His Val Leu Gin Lys Asn His Arg 
405 410 415 

40 

Glu Asp Gly He Leu Thr Lys Leu He Cys Lys Val Ser Thr Leu Arg 
420 425 430 

Ala Leu Cys Gly Arg His Thr Glu Lys Leu Met Ala Phe Lys Ala He 
45 435 440 445 

Tyr Pro Asp He Val Arg Leu His Phe Pro Pro Leu Tyr Lys Glu Leu 
450 455 460 

50 Phe Thr Ser Glu Phe Glu Pro Ala Met Gin He Asp Gly 
465 470 475 



<210> 9 
55 <211> 3646 
<212> DNA 
<213> Homo sapiens 

<400> 9 

60 ctgcctttgg agaaagtgga gtgtggcgct tggttgtcgt tatttcttcg gactgcttcg 60 
cggtgcacgg attcagcttc tgcccagtgg ggctttcagc tgtttgcgcg tctctctgtc 120 
cccctcccct ccccccggca cacctctgtc tacgatgagg aaaggtctgc gggcgacagc 180 
ggcccgctgc ggactgggac tgggatactt gctgcaaatg ctcgtgctac ctgccctggc 240 
cctgctcagc gccagcggca ctggctccgc cgcccaagat gatgactttt ttcatgaact 300 

65 cccagaaact tttccttctg atccacctga gcctctgcca catttcctta ttgagcctga 360 
agaagcttat attgtgaaga ataagcctgt gaacctgtac tgtaaagcaa gccctgccac 420 
ccagatctat ttcaagtgta atagtgaatg ggttcatcag aaggaccaca tagtagatga 480 
aagagtagat gaaacttccg gtctcattgt ccgggaagtg agcattgaga tttcgcgcca 540 
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gcaagtggaa gaactctttg gacctgaaga ttactggtgc cagtgtgtgg cctggagctc 600 
cgcgggtacc acaaagagcc ggaaggcgta tgtgcgcatt gcatatctac ggaagacatt 660 
tgagcaggaa cccctaggaa aggaag£gtc tttggaacag gaagtcttac tccagtgtcg 720 
accacctgaa gggatcccag tggctgaggt ggaatggttg aaaaatgaag acataattga 780 
5 tcccgttgaa gatcggaatt tttatattac tattgatcac aacctcatca taaagcaggc 840 
ccgactctct gatactgcaa attacacctg tgttgccaaa aacattgttg ccaagaggaa 900 
aagtacaact gccactgtca tagtctatgt caacggtggc tggtccacct ggacggagtg 960 
gtctgtgtgt aacagccgct gtggacgagg gtatcagaaa cgtacaagga cttgtaccaa 1020 
cccggcacca ctcaatgggg gtgccttctg tgaagggcag agtgtgcaga aaatagcctg 1080 

10 tactacgtta tgcccagtgg atggcaggtg gacgccatgg agcaagtggt ctacttgtgg 1140 
aactgagtgc acccactggc gcaggaggga gtgcacggcg ccagccccca agaatggagg 1200 
caaggactgc gacggcctcg tcttgcaatc caagaactgc actgatgggc tttgcatgca 1260 
gactgctcct gattcagatg atgttgctct ctatgttggg attgtgatag cagtgatcgt 1320 
ttgcctggcg atctctgtag ttgtggcctt gtttgtgtat cggaagaatc atcgtgactt 1380 

15 tgagtcagat attattgact cttcggcact caatgggggc tttcarcctg tgaacatcaa 1440 
ggcagcaaga caagatctgc tggctgtacc cccagacctc acgtcagctg cagccatgta 1500 
cagaggacct gtctatgccc tgcatgacgt ctcagacaaa atcccaatga ccaactctcc 1560 
aattctggat ccactgccca acctgaaaat caaagtgtac aacacctcag gtgctgtctc 1620 
.cccccaagat gacctctctg agtttacgtc caagctgtcc cctcagatgra cccagtcgtt 1680 

20 gttggagaat gaagccctca gcctgaagaa ccagagtcta gcaaggcaga ctgatccatc 1740 
ctgtaccgca tttggcagct tcaactcgct gggaggtcac cttattgttc ccaattcagg 1800 
agtcagcttg ctgattcccg ctggggccat tccccaaggg agagtctacg aaatgtatgt 1860 
gactgtacac aggaaagaaa ctatgaggcc acccatggat gactctcaga cacttttgac 1920 
ccctgtggtg agctgtgggc ccccaggagc tctgctcacc cgccccgtcg tcctcactat 1980 

25 gcatcactgc gcagacccca ataccgagga ctggaaaata ctgctcaaga accaggcagc 2040 
acagggacag tgggaggatg tggtggtggt cggggaggaa aacttcacca ccccctgcta 2100 
cattaagctg gatgcagagg cctgccacat cctcacagag aacctcagca cctacgccct 2160 
ggtaggacat tccaccacca aagcggctgc aaagcgcctc. aagctggcca tctttgggcc 2220 
cctgtgctgc tcctcgctgg agtacagcat ccgagtctac tgtctggatg acacccagga 2280 

30 tgccctgaag gaaattttac atcttgagag acagacggga ggacagctcc tagaagaacc 2340 
taaggctctt cattttaaag gcagcaccca caacctgcgc ctgtcaattc acgatatcgc 2400 
ccattccctc tggaagagca aattgctggc taaatatcag gaaattccat tttaccatgt 2460 
ttggagtgga tctcaaagaa acctgcactg caccttcact ctggaaagat ttagcctgaa 2520 
cacagtggag ctggtttgca aactctgtgt gcggcaggtg gaaggagaag ggcagatctt 2580 

35 ccagctcaac tgcaccgtgt cagaggaacc tactggcatc gatttgccgc tgctggatcc 2640 
tgcgaacacc atcaccacgg tcacggggcc cagtgctttc agcatccctc tccctatccg 2700 
gcagaagctc tgtagcagcc tggatgcccc ccagacgaga ggccatgact ggaggatgct 2760 
ggcccataag ctgaacctgg acaggtactt gaattacttt gccaccaaat ccagcccaac 2820 
tggcgtaatc ctggatcttt gggaagcaca gaacttccca gatggaaacc tgagcatgct 2880 

40 ggcagctgtc ttggaagaaa tgggaagaca tgaaacggtg gtgtccttag cagcagaagg 2940 
gcagtattaa ccaccatgct ggaaggggaa atgaaggaca aaaatgcaca gggagtctgt 3000 
ggccgtccag gtgaatcaca gctgaggagg aaatccagat gagaccaatg cacttcacag 3060 
gcaagaatgc agcaggagcc agaaggaaaa cagatacaac tgcccatgta catgcccact 3120 
ttactcggag atcatcacgg gagttaagaa aaattgtgta aatttgtacc ttgaatttag 3180 

45 ctatcaacct aattttcctc ttagttgggc tgtatgctgt gtggtacagg atcttacagt 3240 
ttcctaggaa acgcttttta ttgctatcca gatatatgga taaactttct taacaaaccc 3300 
aatttctaca aatgttgttt acatcaaatt ggacagggat gcagacactg tccatggctc 3360 
gttctatttt tgttcaaatc atttgaagtt gaagctgtgg acggtttgtt gtgtctattt 3420 
cagattagta atttacagag aaatcacaga cttttgctac aaatcgtgtg catcaagtgt 3480 

50 ctcagataat cctcccatca gtgttctgtt tctagaactt gtagaaccag tgttactgtt 3540 
tgtatcaggg aagtggagaa tctaagtgta aaaaagaaat aactaagact cctattcctt 3600 
ggagggaccc ttctggtgcc ctttgggaat aaagctgtag cactgc 3646 

55 <210> 10 
<211> 931 
<212> PRT 

<213> Homo sapiens 
60 <400> 10 

Met Arg Lys Gly Leu Arg Ala Thr Ala Ala Arg Cys Gly Leu Gly Leu 
1 5 . 10 15 

Gly Tyr Leu Leu Gin Met Leu Val Leu Pro Ala Leu Ala Leu Leu Ser 
65 20 . 25 30 

Ala Ser Gly Thr Gly Ser Ala Ala Gin Asp Asp Asp Phe Phe His Glu 
35 .40 45 
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Leu Pro Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe 
50 . 55 60 

5 Leu lie Glu Pro Glu Glu Ala Tyr He Val Lys Asn Lys Pro Val Asn 
65 .70 . 75 80 

Leu Tyr Cys Lys Ala Ser Pro Ala Thr Gin He. Tyr Phe Lys Cys Asn 
85 .90 95 

10 

Ser Glu Trp' Val His Gin Lys Asp His He Val Asp Glu Arg Val Asp 
100 105 110 ■ 

Glu Thr Ser Gly Leu He Val Arg Glu Val Ser He Glu He Ser Arg 
15 115 120 125 

Gin Gin Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys , 
130 135 140 

20 Val Ala Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val 
145 ^ 150 155 . 160 

Arg He Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys 
165 - 170 175 

25 

Glu Val Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu 
180 185 190 

Gly He Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp He He 
30 195 200 205 

Asp Pro Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu 
210 215 220 

35 He He Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val 
225 230 235 240 

Ala Lys Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He 
245 250 255 

40 

Val Tyr Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys 
260 . 265 270 

Asn Ser Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr 
45 275 280 285 

Asn Pro Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val 
290 295 300 

50 Gin Lys He Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr 
305 310 315 • 320 

Pro Trp Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg 
325 330 335 

55 

Arg Arg Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys 
340 345 350 

Asp Gly Leu Val . Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met 
60 355 360 365 

Gin Thr Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He Val 
370 375 380 

65 He Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu Phe 
385 . 390 395 400 

Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp Ser 
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405 410 415 

Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala Arg 
420 425 430 

5 

Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala Met 
435 440 445 

Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys He Pro 
10 450 ' 455 460 

Met Thr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys lie Lys 
465 470 475 480 ! 

15 Val Tyr Asn Thr Ser Gly Ala Val Ser Pro Gin Asp Asp Leu Ser Glu 
485 490 495 

Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu Asn 
500 505 510 

20 

Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp Pro 
515 520 . 525 

Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu He 
25 530 535 540 

Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Gly Ala He Pro 
545 550 555 560 

30 Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu Thr 
565 570 575 

Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val Val 
580 585 590 

35 

Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu Thr 
595 600 605 

Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu Leu 
40 610 615 620 

Lys Asn Gin Ala Ala Gin Gly Gin Trp Glu Asp Val Val Val Val Gly 
625 630 635 640 

45 Glu Glu Asn Phe Thr Thr Pro Cys Tyr He Lys Leu Asp Ala Glu Ala 
645 650 655 

Cys His He Leu Thr Glu Asn Leu Ser Thr Tyr Ala Leu Val Gly His 
660 665 670 

50 

Ser Thr Thr Lys Ala Ala Ala Lys Arg Leu Lys Leu Ala He Phe Gly 
675 680 685 

Pro Leu Cys Cys Ser Ser Leu Glu Tyr Ser He Arg Val Tyr Cys Leu 
55 690. 695 700 

Asp Asp Thr Gin Asp Ala Leu Lys Glu He Leu His Leu Glu Arg Gin 
705 710 715 720 

60 Thr Gly Gly Gin Leu Leu Glu Glu Pro Lys Ala Leu His Phe Lys Gly 
725 730 735 

Ser Thr His Asn Leu Arg Leu Ser He His Asp He Ala His Ser Leu 
.740 745 750 

65 

Trp Lys Ser Lys Leu Leu Ala Lys Tyr Gin Glu He Pro Phe Tyr His 
755 760 765 
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Val Trp Ser Gly Ser Gin Arg Asn Leu His Cys Thr Phe Thr Leu Glu 
770 775 780 

Arg Phe Ser Leu Asn Thr Val Glu Leu Val Cys Lys Leu Cys Val Arg 
5 785 790 795 800 . 

Gin Val Glu Gly Glu Gly Gin He Phe Gin Leu Asn Cys Thr Val Ser 
805 810 815 

10 Glu Glu Pro Thr Gly He Asp Leu Pro Leu Leu Asp Pro Ala Asn Thr 
820 ■ 825 830 



15 



lie Thr Thr Val Thr Gly Pro Ser Ala Phe Ser lie Pro Leu Pro He 

835 840 845 

Arg Gin Lys Leu Cys Ser Ser Leu Asp Ala Pro Gin Thr Arg Gly His 
850 * 855 860 



Asp Trp Arg Met Leu Ala His Lys Leu Asn Leu Asp Arg Tyr Leu Asn 
20. 865 870 875 880 

Tyr Phe Ala Thr Lys Ser Ser Pro Thr' Gly Val He Leu Asp Leu Trp 
885. 890 895 

25 Glu Ala Gin Asn Phe Pro Asp Gly Asn Leu Ser Met Leu Ala Ala Val 
900 905 910 



30 



Leu Glu Glu Met Gly Arg His Glu Thr Val Val Ser Leu Ala Ala Glu 
915 920 925 

Gly Gin Tyr 
930 



35 <2io> H 

<211> 6277 

<212> DNA 

<213> Homo sapiens 

40 <400> 11 

ataaagcaca tttagcattg ttgtgtttta 
agcatataat taagattttt tgaataaact 
tatcattacc ctaatgccca tttttggttg 
caaggatgtc catggttttg gaatttaaga 

45 aattaaagta gcagatggat ttaaagttgc 
tcttggatta ttccattggg tccaatatta 
agaagaagtg agtgctttga tacgagaagt 
gggggaaggg gccactagcc aaagaatgta 
aaacatgctt tctgtagagg ctgcagaaag 

50 ttaaggctgc cctctggtcc tagccttggc 
acaatttatt ttttaaaaag tcatcaagag 
tcttatgtaa tctaatcaag gaaatggaat 
ctcaagagaa aaacatatac atggcaaata 
ttggaattct gtctacaaca ctcctggtat 

55 cacatatata tacgtacata cacatccatg 
gtagacatat . cagcttctta taatatttga 
tttcgctgtt ttagctacca gatcccatct 
gaaagtttct ctatgcatct tttctctcgt 
aacagcaaaa ttattgactg acacttcaaa 

60 aatatttttc tttatgaggg tgaagtagaa 
gaaaaaactg taacaaaaaa atcttcacac 
agtgattaat taaattccaa tgctcctctg 
aatatcgccg ggcatggtgg ctcacgtctg 
gcagatcacg aggtcaggag atcgagacca 

65 actaaaaata caaaaaaaaa aaaagttagg 
gctactcggg aggctgaggc agaagaatgg 
gccaagatcg tgccactgca ctccagcctg 
aaaaaaaaaa ttataactaa agttttgaat 



ataactgcaa atgttctaaa tgtgaataag 60 
cctaaaattg aaatgtttgt gttgtcttta 120 
aggtatttga cagccagaat aatggctccc 180. 
atatgttaca ttacatggca aatctgtaga 240 
taatctgctg actttaagag aggtagattt 300 
tcacaataat ttctaaatgt ggaggaaatt 3 60 
acttgaacca tacttgctgg gtgtgaagat 420 
ggtgtcctct ggaagctgga aacagctaga 480 
gagctcagct ctgggatcat tctcagttcc 540 
tagcctaggc cttggccttc tcatagaatc 600 
aattaatctc tcaagcttgc catgatcatg 660 
atcccctcat atccataagc cctggccaca 720 
tagcagagaa caagaaattt ggaggccatc 780 
tccctctgtg tgtttgtctt tgtctctctc 840 
cacatatgta tttgtattac ttttttcaat 900 
tttctttgtt tttccactag ttcagtcata 960 
gatacatgaa aacaacatga agaagataaa 1020 
gacattcaat tgcatactca ccatcaccca 1080 
taatctcttc taatacttat ttacattaaa 1140 
attaattcac agctctgtac tctggacaaa 1200 
tgatctaaca ctttatattc tcatgagtca 1260 
aaggaggttg ggcatttcta taaattaaaa 1320 
taatcccagc actttgggaa gccaaggcag 1380 
tcctggctta cacggtgaaa ccctgtctct 1440 
tgggcgtagt ggcgggcgcc tgtagtccca 1500 
cgtgaaccca ggaggcggag cttgcagtga 1560 
ggtgacagag cgagactcca tctcaaaaaa 1620 
taaatcattc tgattgtcca ggaatgcaca 1680 
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cccttaaata tatgtaataa aaaattaaag atttcttata gaaaaactta ttgaaaaagt 1740 
ccctgaaaga taacaaatga tattataaac tatgtatttc tgtagtaaat ttcccaaaaa 1800 
tacataattt tatgagcaaa atggtgtgga atatataaaa tattgtatgt atgtgtgtgt 1860 
atataagctt tttaagtgaa gttaagaaac atgaatgagg acctatgaag cattatttct 1920 
5 gtgcacgtac caatatcctg ttatttccag tgcccaaccc taagatctct gactctacca 1980 
cattaacttc agactaaaaa ctaacccttt taaacacacc gtaaatgatt tttttttgtt 2040 
aaatgcaact taaaaatttc aaaatatata atagtaaaaa tgatgtttaa tatattgtgt 2100 
tgaaatgtta aagaataaac tgtagtttga atgcatctga atattttgta ggcatgcaac 2160 
agctcttagt aagaaaccat tccatctttc tgagagaaag ttgtatatta actgaagaga 2220 
10 ggcatcttgt tttgatctgg aggcacaaga aatcttaaag tgtgaatccc attaggttat 2280 
aagataaaaa aaaatcctat cagagaatag gaaaataaaa accactagta atctcattac 2340 
tgcctcttca aatactactg ctatataggt gaacagcttc ccctctagtc tttcctcttt 2400 
cttcccaaaa tagccgacga gtggatcaag. catgtctagt gggatataag cacaaagtat 2460 
gcttattctt gttttctcct ctttttattt gttctgatta ttaatttttc ccctagtcat 2520 
15 agttcttttt ttggatgaaa tttagggcaa gtgttcaggc ttacctagtt acttcagatg 2580 
atgacatctc ctctaaacag taattaatat tttcagtcca tggtaatatt caacatttta 2640 
agctcattta aaatactaac tatgccccct gatatggttt gactctgtgt ctacacccaa 2700 
atctcatgtc aaattgtaat ccccacttat tgaaggtggg gactggtggg aggtgattgg 2760 
atcatagtcg ggggttccag tggtttagcg ccatcctccc tagggctgtc ttgtgataga 2820 
20 gttctcacca gatccgatga gatctgatgg tttcaaagtg tgtagcagtc tcagtctctc 2880 
tgtctgtctg tctgtctctc tctctgtctc tcaatcccct gctccaccat ggtaagatgt 2940 
gcttacttcc cctttatgtt ttgacacgat tgtaagtttc ctgaggcctc ccagccatgc 3000 
ttctgtacag cctgaggaac tgtgagtcaa ttaaatgtct tttcttcata aactacccag 3060 
tctcaggtag ttctttatag cagcatgaaa acagactaat acaccccctt tctcagcctg 3120 
25 tgcttgtggt aaggctagcc aatgcaagca gaaagagggg tgctgtctgc tgtttctcca 3180 
attggcatct gtgccccttc ttcatcttgt cagttatagt gggaggccca agacttactt 3240 
ttggcctcag acacagtcaa ggatcatcct gatgcaaaaa acaaaatcac ctttgaacct 3300 
cttcctccac accatcttcc actttggttt tccgctatgc ctttgtactg gggacaaaga 3360 
ctatccacga atccttactt taaaaactga ggcagtggct ctgaatcatt cccccttgga 3420 
30 cacccccaca catctgaaca ccctcgtgtc ccaacaccac atctgctgaa gttgactgga 3480 
cctgaaatgc tggaggctca gtgttttcgg aaactgcgga gagttctcct gccacaattc 3540 
tcttgttgtc cgtcaaaagc aattacatgt ttcagatgat gaaaagcact gtcatgaaaa 3600 
gctggcacag cagccaagga gacagatgta ttaaaccaca cggcttctca gaactcccac 3660 
ttacgttgga gctcagcaca gagatgagca tggtgatatc cccccgtcta acatcctccc 3720 
35 atttggattt aatgcatgag gctcgtttct gatgaggtgc caaattcttt cggtggacaa 3780 
acacaaagtt gcagaattca gtgatcatac aagtccatgt cctcaaacga aaataataat 3840 
actataaatg caaagatgct accataaaaa atacagtagg tggacttttg tgagtcaaca 3900 
taaaaaaatc cagcatttct aactggtagc atgaggagta ccagtctgca ggatgttaac 3960 
gcactgatag gctatggaca atatattcca cacttacgga agtggggaaa ctataggtca 4020 
40 gacaaagcta caggtacttc tttctttctt ttttcttttt ttctttctga gatggagtct 4080 ; 
ccctctgtcg cctaggctgg agtgcaatgg catgatctca gctcactgca acctccgcct 4140 
cccgggttca agtgattctc ctgcctcagc ctcctgagtc gctgggatta taggcgcaca 4200 
ccaccaacca cccggctaat ttttcatatt tttagtagaa acgaggtttc actgtgttag 4260 
ccagaatggt ctcgaactcc tgacctaggt gatctgcctg cctcagcctc ccaaagtgct 4320 
45 gggattacag gtgtgagcca cggcactcgg cccaagtact tcttaactgc aggacttctc 4380 
agaggctttt aatatggtaa tgagtgctgt gcagtgagat gactcaaatc aatatatagc 4440 
caacacgttt gatatgacca cagaacccat ttccctgaac actttttatc acctcctaga 4500 
gcaggggtca gaaaactact gcccatgtgg tctgatctgg ccactgcttg ttttcgtaaa 4560 
ttttcactgc ccacagccaa cctcatttgt gtacatgccg tccatggctg ctttcacact 4620 
50 gcagccgcag cgatgagtca tgagacacgc acccgtggct cacaaagtct aaaatattta 4680 
ctctttggcc ctttacggaa aacgtgtaca aactttgtcc tctgaaatat agtttgggaa 4740 
cctacttcac tcatgatatc aataattcta aaaaatttca aacaattttt aaagtcaaca 4800 
ttgagaagtt ttccatggtt agaatttaat aattacagtc aataactctc ttaaagcatc 4860 
gtgacaataa agtagaagga cactgtgtaa ttctatatgg tctctgtctt cagggagctc 4920 
55 ccaagctagt ttgtgaaaaa gagaactcaa ggcacaaaca actggggggc aagagatgtt 4980 
ctaaacacta agaggaggca atatggcata gtagaaagaa tgcaagcatg tgtaagagta 5040 
aaacatacac agggcgcaga gtcacaagag tcaggggttg gtcttggctg tgataccttc 5100 
tcctgcgata agagtaagac ctgctttatt gagcacttac tatgtgccag gcaccatccc 5160 
agttgctttc catgaaggat tgctctttaa ttcttatcat aatgaatagg tactattatt 5220 
60 atcccagttt atgggtgagg aaaccaagtc ttaggttgac taacttatta aaataaagca 5280 
catacctagt aggtggtgaa caagagccaa aacaagtctg ttggtcagct aaccaccacg 5340 
ttatagtggc tgctgcttag gggcagcata tatcctctgc atttccccag aggtgacaga 5400 
agtagtggag agggtagtgc tctctaggct ctgacacaca gcccatgttt gctgactcaa 5460 
caaccacaca acgtaatgct atggcacctc aggtggcagg gagggtatcg tgggctggac 5520 
65 tgatcttcag aaggctgcat ctgagcaggg tgtggaagga tggcagaagg aagaggggag 5580 
tgaggcactc agggtagaca gaaggtgata ctgccacatc ctccactcag aaggaatctg 5640 
gaaggttttg acaaatatgt acattttgga gtgggaacag cagaaagtag gggctggcta 5700 
cctactttcc actaccccta tccccactcc ccctaggggg tgaggcagag ataagcacgc 5760 
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tttgtggggc acactgatcc acaaacttgc aagcctgagc tccactcagc agcagcctgg 

gcaggatgaa tggtggtgtg caggtgtttg cggcatgcat agcttctctt tctcctagag 

tcaatccagt gatcccctgc gaaaagaatc ctccacttga cctgaatgct ctgtattggc 

accttctgga caggaatcct atcgggtatt tcatcaaagc atcccacagc agttttaagt 

5 gtggctccat gagagtaaga caccaggaca gctgtgccac ctagtgatgc ttgcctgtac 

tgtctgcatc ccagagagtg ggagaggctt cggaaaggac tggtatttgc tatttttggc 

caacacttct acatgcaaag gaatctgaag tctccgcctg ggcttcaaaa catttcttgt 

ggcctcactg accattcccg tgagcctcag gggacatgag tgtcttcaag ctaagaggga 

gcagtccaac tctgcagcac gaattaccaa ggaacat 

10 

<210> 12 
<211> 1829 
<212> DNA 

15 <213> Homo sapiens 

<400> 12 

ttttgccttg gttttcccag ctattaaaag ggagatatca tcattcattc aacaagcatt 60 
tcttgcggac ccattttcca cgacagaacc atccaaacca agacacttct tcctattgtg 120 

20 gcaaaaatga gcaacgttaa aactccccag aatgcagagg tgactcacgt gtttatgtaa 180 
gtaagcattt agatatggct ttggaaaaga aaacacacac ctaaaagagg ctaccagcat 240 
ttgaataaag tgtgtaacct gggtgttcaa aactgtacaa cagtttctgt gtactttaag 300 
tcactatcct tcatacttct tccctttaaa ttctatttga aactaaataa tatcaacaat 360 
actttaagta agattatgct ggtgttactg atgaaattat atgcaggtct gcttggcctg 420 

25 ttctcatgcg taagcctttg ataacttgtt cctaggttat ccagttctgc ttacctgagt 480 
acccacttta tcccagaaac gagacctgaa cactgtgtcc tggtttcatc tttccagtga 540 
gccagccttc tgaggcatta gcggctcagc aagaccatgg aggtgagaga ggcacaggca 600 
cactccagat gcagggtctc tgccctctgt caagagtcac acggcagaga ggacaagact 660 
cctgtcccgg gggaactgtt tttccttttg ccccacaccc cagatatctt cgtgacagcc 720 

30 cctgccctga aagcccggtg agcgtcatgg cggagaagcc agcctgtttc cgattattct 780 
gcctgacagg gactatctgc tgctgtgtgc agttgtagaa atgtcataag ctcaggactc . 840 
aggacacttt gacgtcgagt ccaacaggac ccatagagtc cccctgaaca attctcatct 900 
gtgtgggaag cctaaggggt ggttttaagt aggggctggc aggagagtga ggtctgggct 960 
gcttctttac ggctcttcaa gtgccggcca tttcctggga ttcccaggag taagtcttga 1020 

35 ctcccatgaa tcccactgac ccctaagtgg catgggaaag gagaggacaa aaagaaagga 1080 
gttagacatt tgactaatac tgttcagagt tgaaatgcgc tctttgagca ggtaggcatt il40 
taaagctaac ttt tec teat gggatacact tgtgggttct ttggagactg atctgcgtga 1200 
ccccaatggc tcaacgagaa tcttctccta caattccctt gactaggtaa tgccctcact 1260 
ctcctgctat cttcttctcc ctggttctca gtgtttcact gecacataaa ctctctctct 1320 

40 cactgttggg gtcccattgc ccctctgagt ggaattcctg ggcaaaagat teaatgeaag 1380 
ctcaccaggc tatctcacat tgggectaca tctggtctcc tggaaatatg cccatatcct 1440 
ttgtctggac aatcctgaga cagacccagc ecaatgeaga agtgcctatt cttctctcag 1500 
cagcagatta acttgtctgg gagttactca tcctcagact tctcaagctc attccgacac 1560 
ctgtccacct aatgcagcaa ctgcacgtga cacaataaac acttaatgea agtcacaaat 1620 

45 gggagtcaca tatataatat tattagtagc caegctagaa aaatgagaca gaaacagata 1680 
aagttaattt caatttcagt aacatttcat ttaatctaat ctgtcaaaaa tactattgtc 1740 
tcaatatata atcaacaaaa ctgttaatga aatacattac agtttttttg tattaagtct 1800 
ttgaaatgga gtgtgtattt tagacttaa 1829 

,50 

<210> 13 

<211> 2612 

<212> DNA 

<213> Homo sapiens 

55 

<400> 13 

atgaggaaag gtctgeggge gacagcggcc cgctgcggac tgggactggg atacttgetg 60 
caaatgeteg tgctacctgc cctggccctg ctcagcgcca gcggcactgg ctccgccgcc 120 
caagatgatg acttttttca tgaactccca gaaacttttc cttctgatcc acctgagcct 180 

60 ctgccacatt tccttattga gectgaagaa gcttatattg tgaagaataa gcctgtgaac 240 
ctgtactgta aagcaagccc tgccacccag atctatttca agtgtaatag tgaatgggtt 300 
catcagaagg accacatagt agatgaaaga gtagatgaaa cttccggtct cattgtcegg 360 
gaagtgagca ttgagatttc gcgccagcaa gtggaagaac tctttggacc tgaagattac 420 
tggtgccagt gtgtggcctg gagctccgcg ggtaccacaa agagceggaa ggcgtatgtg 480 

65 egcattgeat atetaeggaa gacatttgag caggaacccc taggaaagga agtgtctttg 540 
gaacaggaag tcttactcca gtgtcgacca cctgaaggga tcccagtggc tgaggtggaa 600 
tggttgaaaa atgaagacat aattgatccc gttgaagatc ggaattttta tattactatt 660 
gatcacaacc tcatcataaa gcaggcccga ctctctgata ctgeaaatta cacctgtgtt 720 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6277 
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gccaaaaaca ttgttgccaa gaggaaaagt acaactgcca ctgtcatagt ctatgtcaac 780 
ggtggctggt ccacctggac ggagtggtct gtgtgtaaca gccgctgtgg acgagggtat 840 
cagaaacgta caaggacttg taccaacccg gcaccactca atgggggtgc cttctgtgaa 900 
gggcagagtg tgcagaaaat agcctgtact acgttatgcc cagtggatgg caggtggacg 960 
5 ccatggagca agtggtctac ttgtggaact gagtgcaccc actggcgcag gagggagtgc. 1020 
acggcgccag cccccaagaa tggaggcaag gactgcgacg gcctcgtctt gcaatccaag 1080 
aactgcactg atgggctttg catgcagact gctcctgatt cagatgatgt tgctctctat 1140 
gttgggattg tgatagcagt gatcgtttgc ctggcgatct ctgtagttgt ggccttgttt 1200 
gtgtatcgga agaatcatcg tgactttgag tcagatatta ttgactcttc ggcactcaat 1260 

10 gggggctttc agcctgtgaa catcaaggca gcaagacaag atctgctggc tgtaccccca 1320 
gacctcacgt cagctgcagc catgtacaga ggacctgtct atgccctgca tgacgtc tea . 1380 
gacaaaatcc caatgaccaa ctctccaatt ctggatccac tgcccaacct gaaaatcaaa 1440 
gtgtacaaca cctcaggtgc tgtcaccccc caagatgacc tctctgagtt tacgtccaag 1500 
ctgtcccctc agatgaccca gtcgttgttg gagaatgaag ccctcagcct gaagaaccag 1560 

15 agtctagcaa ggcagactga tccatcctgt acegcatttg gcagcttcaa ctcactggga 1620 
ggtcacctta ttgttcccaa ttcaggagtc agettgetga ttcccgctgg ggccattccc 1680 
. caagggagag tctacgaaat gtatgtgact gtacacagga . aagaaactat gaggccaccc 1740 
atggatgact ctcagacact tttgacccct gtggtgagct gtgggccccc aggagctctg 1800 
ctcacccgcc ccgtcgtcct cactatgeat cactgcgcag accccaatac cgaggactgg 1860 

20 aaaatactgc tcctcgctgg agtacagcat ccgagtctac tgtctggatg acacccagga 1920 
tgccctgaag gaaattttac atcttgagag acagaeggga ggacagctcc tagaagaacc 1980 
taaggctctt cattttaaag gcagcaccca caacctgcgc ctgtcaattc aegatatege 2040 
ccattccctc tggaagagca aattgctggc taaatatcag gaaattccat tttaccatgt 2100 
ttggagtgga tctcaaagaa acctgcactg caccttcact ctggaaagat ttagcctgaa 2160 

25 cacagtggag ctggtttgca aactctgtgt gcggcaggtg gaaggagaag ggcagatctt 2220 
ccagctcaac tgcaccgtgt cagaggaacc tactggcatc gatttgeegc. tgctggatcc 2280 
tgcgaacacc atcaccacgg tcacggggcc cagtgettte agcatccctc tccctatccg 2340 
gcagaagctc tgtagcagee tggatgcccc ccagacgaga ggecatgact ggaggatget 2400 
ggcccataag ctgaacctgg acaggtactt gaattacttt gccaccaaat ccagcccaac 2460 

30 tggegtaate ctggatcttt gggaagcaca gaacttccca gatggaaacc tgagcatget 2520 
ggcagctgtc ttggaagaaa tgggaagaca tgaaacggrtg gtgtccttag cagcagaagg 2580 
gcagtattaa ccaccatgct ggaaggggaa at 2612 

35 <210> 14 
<211> 636 
<212> PRT 
<213> Homo sapiens 

40 <400> 14 

Met Arg Lys Gly Leu Arg Ala Thr Ala Ala Arg Cys Gly Leu Gly Leu 
1 5 10 15 

Gly Tyr Leu Leu Gin Met Leu Val Leu Pro Ala Leu Ala Leu Leu Ser 
45 20 25 30 

Ala Ser Gly Thr Gly Ser Ala Ala Gin Asp Asp Asp Phe Phe His Glu 
35 40 45 

50 Leu Pro Glu Thr Phe Pro Ser Asp Pro Pro Glu Pro Leu Pro His Phe 
50 55 60 

Leu He Glu Pro Glu Glu Ala Tyr lie. Val Lys Asn Lys Pro Val Asn 
65 70 75 80 

55 

Leu Tyr Cys Lys Ala Ser Pro Ala Thr Gin He Tyr Phe Lys Cys Asn 
85 90 95 

Ser Glu Trp Val His Gin Lys Asp His He Val Asp Glu Arg Val Asp 
60 100 105 110 

Glu Thr Ser Gly Leu He Val Arg Glu Val Ser He Glu He Ser Arg 
115 120 125 

65 Gin Gin Val Glu Glu Leu Phe Gly Pro Glu Asp Tyr Trp Cys Gin Cys 
130 135 140 

Val Ala Trp Ser Ser Ala Gly Thr Thr Lys Ser Arg Lys Ala Tyr Val 
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145 



150 



155 



160 



Arg lie Ala Tyr Leu Arg Lys Thr Phe Glu Gin Glu Pro Leu Gly Lys 
165 170 175 

5 

Glu Val Ser Leu Glu Gin Glu Val Leu Leu Gin Cys Arg Pro Pro Glu 
180 . 185 190 

Gly lie Pro Val Ala Glu Val Glu Trp Leu Lys Asn Glu Asp lie lie 
10 195 200 205 

Asp Pro Val Glu Asp Arg Asn Phe Tyr He Thr He Asp His Asn Leu 
210 215 220 

15 He He Lys Gin Ala Arg Leu Ser Asp Thr Ala Asn Tyr Thr Cys Val 
225 230 235 240 

Ala Lys Asn He Val Ala Lys Arg Lys Ser Thr Thr Ala Thr Val He 
245 250 255 

20 . 

Val Tyr Val Asn Gly Gly Trp Ser Thr Trp Thr Glu Trp Ser Val Cys 
260 265 270 

Asn Ser Arg Cys Gly Arg Gly Tyr Gin Lys Arg Thr Arg Thr Cys Thr 
25 275 . 280 285 

Asn Pro Ala Pro Leu Asn Gly Gly Ala Phe Cys Glu Gly Gin Ser Val 
290 295 300 

30 Gin Lys He Ala Cys Thr Thr Leu Cys Pro Val Asp Gly Arg Trp Thr 
305 .310 315 320 



35 



Pro Trp Ser Lys Trp Ser Thr Cys Gly Thr Glu Cys Thr His Trp Arg 
325 . 330 335 

Arg Arg Glu Cys Thr Ala Pro Ala Pro Lys Asn Gly Gly Lys Asp Cys 
340 345 350 



Asp Gly Leu Val Leu Gin Ser Lys Asn Cys Thr Asp Gly Leu Cys Met 

40 355 360 365 

Gin Thr Ala Pro Asp Ser Asp Asp Val Ala Leu Tyr Val Gly He Val 

370 375 380 

45 He Ala Val He Val Cys Leu Ala He Ser Val Val Val Ala Leu Phe 

385 390 395 400 



50 



Val Tyr Arg Lys Asn His Arg Asp Phe Glu Ser Asp He He Asp Ser 
405 " 410 415 

Ser Ala Leu Asn Gly Gly Phe Gin Pro Val Asn He Lys Ala Ala Arg 
420 425 430 



Gin Asp Leu Leu Ala Val Pro Pro Asp Leu Thr Ser Ala Ala Ala Met 
55 435 440 445 

Tyr Arg Gly Pro Val Tyr Ala Leu His Asp Val Ser Asp Lys He Pro 
450 455 460 

60 Met Thr Asn Ser Pro He Leu Asp Pro Leu Pro Asn Leu Lys lie Lys 
465 470 475. 480 



65 



Val Tyr Asn Thr Ser Gly Ala Val Thr Pro Gin Asp Asp Leu Ser Glu 
485 490 495 

Phe Thr Ser Lys Leu Ser Pro Gin Met Thr Gin Ser Leu Leu Glu Asn 
500 505. 510 
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Glu Ala Leu Ser Leu Lys Asn Gin Ser Leu Ala Arg Gin Thr Asp Pro 
515- 520 .525 

Ser Cys Thr Ala Phe Gly Ser Phe Asn Ser Leu Gly Gly His Leu He 
5 .530 535 540 

Val Pro Asn Ser Gly Val Ser Leu Leu He Pro Ala Gly Ala He Pro 
545 550 555 560 

10 Gin Gly Arg Val Tyr Glu Met Tyr Val Thr Val His Arg Lys Glu Thr 
565 570 575 

Met Arg Pro Pro Met Asp Asp Ser Gin Thr Leu Leu Thr Pro Val Val 

580 585 . . . 590 

15 

Ser Cys Gly Pro Pro Gly Ala Leu Leu Thr Arg Pro Val Val Leu Thr 
595 600 605 

Met His His Cys Ala Asp Pro Asn Thr Glu Asp Trp Lys He Leu Leu 
20 610 615 620 

Leu Ala Gly Val Gin His Pro Ser Leu Leu Ser Gly 
625 630 635 

25 

<210> 15 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
30 . 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF402L11I 

35 <400> 15 

atggagactt cgctcactgg tcctcttt 28 



<210> 16 
40 <211> 28 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

45 <223> Description of Artificial Sequence : Reverse Primer 
MF402L11I ■ ' 



28 



<400> 16 

cctggaggca cataaaccct agacatcc 

50 

<210> 17 
<211> 21 
<212> DNA 
55 <213>. Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF402L11II 

60 

<400> 17 

acatcctgag gtaggtggtc c 21 



65 <210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF402L11II 

5 

<400> 18 

gggtctcact cattctgttg c 



10 <210> 19 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
15 <220> 

<223> Description of Artificial Sequence: Forward Primer 
MF402L11III 

<400> 19 
20 ctgtattttg actcaattg.t g 



<210> 20. 

<2li> 21 

25 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse 
30 Primer MF402L11III 

<400> 20. 

ccaaaagcct ctccttatta c 

35 

<210> 21 
. <211> 28 
<212> DNA 

<213> Artificial Sequence 

40 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
MF402L11IV 

45 <400> 21 

gaatacagac acacagagac agcacctt. 



<210> 22 
50 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

55 <223> Description of Artificial Sequence: Reverse Primer 
MF402L11IV 

<400> 22 

gttccttgct ccagtgaaag acataatc 

60 

<210> 23 
<211> 28 
<212> DNA 
65 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
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MF402L11V 
<400> 23 

ctaacaccag ttcagtggaa cccattgc 



<210> 24 
<211> 28 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF402L11V 

15 

<400> 24 

atttgatgtt catacagggt tgggggac 



20 <210> 25 
<211>. 28 
<212> DNA 

<213> Artificial Sequence 
25 <220> 

<223> Description of Artificial Sequence: Forward Primer 
MF402I,11VT ' 

<400> 25 

30 ctccacttgg actctgatga caccactg 



<210> 26 
<211> 27 
35 <212> DNA 

<213> Artificial Sequence 

• <220> 

<223> Description of Artificial Sequence: Reverse Primer 
40 MF402L11VI 

<400> 26 

catagttcac tgcagcctca acctcct 

45 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

50 

<220> . 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19I 

55 <400> 27 

taattatgat tagacagacc gtcctcca 



<210> 28 
60 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

65 <223> Description of Artificial Sequence: Reverse Primer 
MF90A19I 



<400> 28. 



<210> 30 
<211> 28 
<212> DNA 
20 <213> Artificial Sequence 



<220> 



25 



<223> ^9m9li° n ° f MUU ^ sequence: Reverse Primer 



<400> 30 

gatggtgtgg aggaagaggt tcaaaggt 



30 <210> 31 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
35 <220> 



40 



<223> SSSS^T ° f Artificial sequence: Forward Primer 
<400> 31 

aactgagaac tggaaggtag ctttggga 



<210> 32 
<211> 28 
45 <212> DNA 

<213> Artificial Sequence 

<220> 

50 <223> KSSSr ° f tatifici ^ seance: Reverse Primer 
<400> 32 

acaagagcag tcagaactat ttggccct 

<210> 33 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> SSSSSF ° f Artificial Se *— ^rwara Primer 



55 



60 



65 <400> 33 

atttgctggc tccgacagta agtattcc 
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, <210> 34 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence: Reverse . Primer 
MF90A19IV 

10 <400> 34 

atgcagtgat gcttaacacc ctgtctct 



<210> 35 
15 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial. Sequence: Forward Primer 
MF90A19V 

<400> 35 . 

cctgctctgt ctgccttgga aaagaaat 

25 

<210> 36 
<211> 28 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF90A19V 

35 

<400> 36 

tcttggccca gacggtataa gagtccat 



40 <210> 37 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
45 <220> 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19VT . . 

<400> 37 

50 agtgcaccag taagcatgcc ttccaata 



<210> 38 
<211> 28 . 
55 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
60 MF90A19VT 

<400> 38 

aagctggcaa ggagattctc aaaagtgg 

65 

<210> 39 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Forward Primer 
5 MF90A19VII 

<400> 39 

accacatctg ctgaagttga ctggacct 

10 

<210> 40 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

15 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF90A19VII 

20 <400> 40 

tctgggcaac atagtgaaac cctgtctc 



<210> 41 
25 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

30 <223> Description of Artificial Sequence: Forward Primer 
MF90A19VIII 

<400> 41 

cttgaaccta ggagacggag gttgcagt 



<210> 42 
<211> 28 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
MF90A19VIII 

45 

<400> 42 

aagcatatct ctgaccctct ggCcattc 



50 <210> 43 
<211>. 28 
<212> DNA 

<213> Artificial Sequence 
55 <220> 

<223> Description of Artificial Sequence: Forward Primer 
MF90A19IX 

<400> 43 

60 gaagctgcaa tcccatcctt gtctcttt 



<210> 44 
<211> 28 
65 <212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Reverse Primer 
MF90A19IX 

<400> 44 

5 tggtggtgca cacctgtaat accagcta 

<210> 45 
<211> 20 
10 <212> DNA 

<213> Artificial Sequence 

. <2.20> 

<223> Description of Artificial Sequence: Forward Primer 
15 C4P3F 

<400> 45 

aggatgtcca tggttttgga 

20 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
C4P3R 

30 <400> 46 

ctgtttccag cttccagagg 



<210> 47 
35 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequence: Forward Primer 
C4P4F 

<400> 47 

catggtccca aagctctgtt 

45 

<210> 48 
<211> 20 
<212> DNA 
50 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
- C4P4R 

55 

<400> 48 

tgacggaatt tcagtgcaac 



60 <210> 49 
• <211> 20 
<212> DNA 

<213> Artificial Sequence 
65 <220> 

<223> Description of Artificial Sequence: Forward Primer 
C4P7F 
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<400> 49 

tctgatacag gggatttggc 



5 <210> 50 
' <211> 20 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
C4P7R 

<400> 50 
15 ttcaggtggg tgatgtgtgt 



<210> 51 
<211> 20 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
25 C4P8F 

<400> 51 

aggaaagagg ggtgctgtct 

30 

<210> 52 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
C4P8R 

40 <400> 52 

atagcaggag agtgagggca 



<210> 53 
45 <211> 20 
<212> DNA 

<213> Artificial Sequence 



<220>. 

50 <223> Description of Artificial Sequence: Forward Primer 
C4P9F 

<400> 53 
^ ggcccacttt ctggttcata 

<210> 54 
<211> 20 
<212> DNA 
60 <213> Artificial Sequence 

' <220> 

<223> Description of Artificial Sequence: Reverse Primer 
C4P9R 

65 

<400> 54 

tcagccagac cagtgtgaaa 
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<210> 55 
<211> 20 
<212> DNA 
5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
C15P1F 

10 

<400> 55 

ggtctctgcc ctctgtcaag 



15 <210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
C15P1R 

<400> 56 
25 aggcttccca cacagatgag 



<210> 57 
<211> 20 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
35 C15P2F 

<400> 57 

cactcttggc acccttaacc 

40 . 

<210> 58 
<2X1> 20 
<212> DNA 

<213>' Artificial Sequence 

45 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
C15P2R 

50 <400> 58 

tttgttctgt cctccccaac 



<210> 59 
55 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

60 <223> Description of Artificial Sequence: Forward Primer 
C15P5F 

<400> 59 

tttcatcaaa gcatcccaca 

65 

<210> 60 
<211> 20 
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<212> DNA 

<213>. Artificial Sequence 
<220> 

5 <223> Description of Artificial Sequence: Reverse Primer 
C15P5R 

<400> 60 

tggtaattcg tgctgcagag 

10 

<210> 61 
<211> 20 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
C15P6F 

20 

<400> 61 

cccaaacatt ccaacttcgt 



25 <210> 62 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
30 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
C15P6R 

<400> 62 
35 tagctcaagc ctgaacccat 



<210> 63 
<211> 26 
40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
45 BPC4F 

<400> 63 

accctaatgc ccatttttgg ttgagg 

50 

<210> 64 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

55 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
BPC15R 

60 <400> 64 

aaataccagt cctttccgaa gcctctcc 



<210> 65 
65 <211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Forward Primer 
BPC15F 

5 <400> 65 

cgacagaacc atccaaacca agacactt 



<210> 66 
10 <211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Reverse Primer 
BPC4R 

<400> 66 

gtgtttattg tgtcacgtgc agttgctg 

20 

• <210> 67 
<211> 27 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 
RORal 5' UTR 

30 

<400> 67 

ctgaaaacag aagatagagg gagtctc 



35 <210> 68 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
40 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
UNC5c 3'UTR 

<400> 68 
45 atttcccctt ccagcatggt 



<210> 69 
<211> 27 
50 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 

55 api 

<400> 69 

ccatcctaat acgactcact atagggc 

60 

<210> 70 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

65 

<220> 

<223> Description of Artificial Sequence: Reverse Primer 
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- 31- 

<400> 70 

ctccacagat cttgcatgga at 

5 <210> 71 
<211> 23. ' 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Forward Primer 
AP2 

<400>, 71 
15. actcactata gggctcgagc ggc 

<210> 72 
<211> 23 
20 <212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Description of Artificial Sequence: Forward Primer 

25 

<400> 72 

tctgtgtttc atacccagga ggt 

30 <210> 73 
<211> 23 
<212> DNA 

<213>. Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence: Reverse Primer 
<400> 73 

agaacacaca cccatagact gca 

40 

<210> 74 •■ 
<211> 19 
<212> DNA 
45 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Forward Primer 

50 <400> 74 

cctgtaccgc atttggcag 



<210> 75 
55 <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

.60 <223> Description of Artificial Sequence: Reverse Primer 
<400> 75 

tgcaaaccag ctccactgtg 

65 



